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Method for producing a nucleotide sequence construct with optimised codons for an HIV 
genetic vaccine based on a primary, early HIV isolate and synthetic envelope BX08 
constructs. 



5 Field of the invention 

The invention relates to a DNA vaccine against HIV, which is designed from a clinical 
primary isolate. One aspect of the invention relates to a method of producing a nucleotide 
sequence construct, in a prefered aspect based on a cassette system, the nucleotide 
sequence construct being used as a DNA vaccine. The method can, for example, lead to the 
10 disclosed synthetic BX08 HIV-1 envelope vaccine nucleotide sequence construct, designed 
to generate suitable DNA vaccines against HIV, specifically HIV-1. Furthermore, the 
invention can be used for the production of recombinant protein antigens. 



Background of the invention 

15 There is an urgent need for new vaccine strategies against HIV. One such new promising 
strategy is called genetic immunisation or DNA vaccine (Webster et al 1997). Some of the 
advantages of a DNA vaccine against HIV is the induction of Th cell activation, induction of 
antibodies also against conformational dependent epitopes, and the induction of cellular 
immunity. So far, most DNA vaccine envelope genes tried, have been from tissue culture 

20 adapted virus strains (Boyer et al 1997) that often differs in several aspects from primary 
clinical isolates (such as early isolates) e.g. in co-receptor usage (Choe et al 1996, Dragicet 
al1997). 

One disadvantage in HIV envelope based DNA vaccines may be the intrinsic relatively low 
25 expression which is regulated by the Rev expression. This may prevent an optimal 
investigation of the vaccines in small animal models like mice where Rev is functioning 
suboptimally. Recently it has been shown using the tissue culture adapted HIV-1 MN strain, 
that an exchange of the HIV codon usage to that of highly expressed mammalian genes 
greatly improves the expression in mammalian cell lines and renders the HIV expression Rev 
30 independent (Haas et al 1996). Additionally, it is known that rare codons cause pausing of 
the ribosome, which leads to a failure in completing the nascent polypeptide chain and a 
uncoupling of transcription and translation. Pausing of the ribosome is thought to lead to 
exposure of the 3' end of the mRNA to cellular ribonucleases. 
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The world-wide spread of HIV-1 has presently resulted in 8,500 new infections daily and 
AIDS is now the number 1 cause of death among US males (and number 3 among US 
females) aged 25-40 years. The epidemic hot-spots now include Eastern Europe, India and 
5 South East Asia and southern Africa. The attempts to soive this world-wide problem involve 
education, prevention, treatment and vaccine development. Affordable protective vaccines 
represent the best solution to the world-wide problem of infection with HIV-1 . Induction of 
virus neutralising antibodies is one of the key components in vaccine development. Several 
recombinant envelope vaccines have been tested in humans and animals, however, they 

10 seem unable to induce sufficient protection. In this respect DNA vaccination may provide a 
different and more natural mode of antigen presentation. It is hoped that the immune 
responses induced by such DNA vaccines could aid in limiting virus replication, slowing 
disease progression or preventing occurrence of disease. Unfortunately many HIV envelope 
vaccines induce only moderate levels of antibodies. This could in part be due to limitations in 

15 expression, influenced by regulation by the Rev protein and by a species-specific and biased 
HIV codon usage. Also the virus variability is considered a barrier for development of 
antibody based vaccines and thus a tool for more easy producing of closely related vaccine 
variants is needed. 

20 It has been suggested to improve the immunogenicity and antigenicity of epitopes by certain 
mutations in the envelope gene. An elimination of certain immune dominant epitopes (like 
V3) could render less immune dominant but more relevant, conserved, or hidden epitopes 
more immunogenic (Bryder et al 1999). Also elimination of certain N-linked glycosylation 
sites could improve the exposure of relevant epitopes and increase the immunogenicity of 

25 those epitopes. Thus, it is possible that elimination of the glycosylation sites in V1 and V2 
may in a more favourable way expose neutralising epitopes (Kwong et al 1998, Wyatt et al 
1998). The HIV envelope contains putative internalisation sequences in the intracellular part 
of gp41 (Sauter et al 1996). Thus it would be relevant to eliminate and/or mutate the 
internalisation signals in a membrane bound HIV envelope vaccine gene to increase the 

30 amount of surface exposed vaccine derived HIV glycoproteins as gp150. Since the antibody 
response, that is measured and calculated in titers, is improved by adding the secreted 
gp120 as opposed to adding the membrane bound form (Vinner et al 1999), it could be 
advantageous to express the vaccine as a secreted gp120 or a secreted gp140. This would 
include important parts of gp41 , such as the 2F5 neutralising linear epitope (Mascola et al 

35 1997). 
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Summary of the invention 

Our suggested solution to the problems described above is to design DNA envelope 
vaccines from a clinical primary isolate with Rev-independent high expression in mammals, 
that is built as a cassette for easy variant vaccine production. 

5 

A method of producing a nucleotide sequence construct with codons from highly expressed 
mammalian proteins based on a cassette system coding for an early, primary HIV envelope 
is described. The method comprises the steps of direct cloning of an HIV gene, derived from 
a patient within the first 12 months of infection, thereby obtaining a first nucleotide sequence; 

10 designing a second nucleotide sequence utilising the most frequent codons from mammalian 
highly expressed proteins to encode the same amino acid sequence as the first nucleotide 
sequence; redesigning the second nucleotide sequence so that restriction enzyme sites 
surround the regions of the nucleotide sequence encoding functional regions of the amino 
acid sequence and so that selected restriction enzyme sites are removed, thereby obtaining 

15 a third nucleotide sequence encoding the same amino acid sequence as the first and the 
second nucleotide sequence; redesigning the third nucleotide sequence so that the terminals 
contain convenient restriction enzyme sites for cloning into an expression vehicle; producing 
snuts between restriction enzyme sites as well as terminal snuts and introducing snuts into 
an expression vehicle by ligation. The nucleotide sequence construct obtained by this 

20 method uses the mammalian highly expressed codons (figure 1) and renders the envelope 
gene expression Rev independent and allows easy cassette exchange of regions 
surrounded by restriction enzyme sites that are important for immunogenicity, function, and 
expression. 

25 The method can, for example, lead to the disclosed synthetic, Rev-independent, clinical 
(such as early), primary HIV-1 envelope vaccine gene, built as a multi cassette. From the 
sequence of the envelope of the HIV-1 BX08 isolate (personal communication from Marc 
Girard, Institute Pasteur, Paris), the present inventors have designed a synthetic BX08 HIV-1 
envelope vaccine nucleotide sequence construct. 

30 

With the great diversity of envelopes in HIV among different patients and within one patient, 
it would be of advantage to vaccinate with several envelope variants, all being highly 
expressed. To avoid synthesising several full length envelopes, it is much easier to 
exchange relevant parts of an envelope cassette to various strains in a multivalent vaccine. 

35 
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Whether it is the disclosed synthetic BX08 nucleotide sequence construct, or any of the 
nucleotide sequence constructs obtained by the method, they are designed to generate 
suitable DNA vaccines against HIV, specifically HIV-1. In this case the mammal, preferably a 
human being, is inoculated with the nucleotide sequence construct in an expression vehicle 
5 and constitutes a host for the transcription and translation of the nucleotide sequence 
construct. The nucleotide sequence constructs of the present invention can furthermore be 
used for the production of recombinant protein antigens. In this case the nucleotide 
sequence construct is placed in an expression vehicle and introduced into a system (e.g. a 
cell-line), allowing production of a recombinant protein with the same amino acid sequence. 
10 The recombinant protein is then isolated and administered to the mammal, preferably a 
human being. The immune system of the mammal will then direct antibodies against 
epitopes on the recombinant protein. The mammal, preferably a human being, can thus be 
primed or boosted with DNA and/or recombinant protein obtained by the method of the 
invention. 

15 

A relevant HIV DNA vaccine can potentially be used not only as a prophylactic vaccine, but 
also as a therapeutic vaccine in HIV infected patients, e.g. during antiviral therapy. An HIV 
specific DNA vaccine will have the possibility to induce or re-induce the wanted specific 
immunity and help the antiviral therapy in limiting or even eliminating the HIV infection. The 

20 immunogenicity and antigenicity of epitopes in the envelope can be improved by certain 
mutations in the envelope gene. The cassette system allows for easy access to the relevant 
parts of the envelope gene, and thereby eased efforts in the process of genetic manipulation. 
Some suggested mutations are: an elimination of certain immune dominant epitopes (like 
V3); elimination of certain N-linked glycosylation sites (like glycosylation sites around V2); 

25 elimination and/or mutation of the nucleotide sequence encoding the internalisation signals in 
the cytoplasmic part of a membrane bound HIV envelope to increase the amount of surface 
exposed vaccine derived HIV glycoproteins; elimination or mutation of the cleavage site 
between gp120 and gp41; with introduced mutations in gp41 for preserving conformational 
epitopes. 

30 

Table 1 below, lists the nucleotide sequence constructs of the invention by the names used 
herein, as well as by reference to relevant SEQ ID NOs of DNA sequences, and the amino 
acid sequence encoded by the DNA sequence in the preferred reading frame. It should be 
noted, that the snut name consist of the number of the approximate position for the end of 
35 the snut and the restriction enzyme used to cleave and/or iigate that end of the snut. 
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Table 1 List of names of nucleotide sequence constructs (Snuts (S) and Pieces (P)) 
with reference to SEQ ID NO for the nucleotide sequence and protein sequence. 



Name 


Nucleotide SEQ ID 


Protein SEQ ID 




NO: 


NO: 


So-N-Lang 


1 


2 


S235EcoRV 


3 


4 


S375Pstl 


5 


6 


S495CI8I 


7 


8 


Se50-720EcoRI 


9 


10 


S&00Xbal 


11 


12 


SwoSact 


13 


14 


Smospei 


15 


16 


Si265Xhol 


17 


18 


Si265Qp120 


19 


20 


Si266gp160 


21 


22 


Si465Pstl 


23 


24 


Si465Pstl cys 


25 


26 


Si630Xbal 


27 


28 


SuoOEagl 


29 


30 


Si890Hindlll 


31 


32 


S2060Sac(l 


33 


34 


S210OCIal 


35 


36 


S2330Pstl 


37 


38 


S2425ES 


39 


40 


Pi 


41 


42 


P 2 


43 


44 


P 3 


45 


46 


P3GVI 


47 


48 


P3 GV1V2 


49 


50 


P3GV2 


51 


52 


P4gp160 


53 


54 


P4gp150 


55 


56 


P4gp140 


57 


58 


Ps 


59 


60 


Psgp160 


61 


62 


P8gp150 


63 


64 


Pegp140 


65 


66 


synBX08-140 


67 


68 


synBX08-150 


69 


70 


synBX08-160 


71 


72 


synBX08-120 


73 


74 


synBX08-41 


75 


76 
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Detailed disclosure of the invention 

One aspect of the present invention relates to a method for producing a nucleotide sequence 
construct coding for an HIV gene. The nucleotide sequence construct is produced as a 
cassette system consisting of snuts. A snut (S) is a nucleotide sequences construct between 
5 restriction enzyme cleavage sites comprising the minimal entity of the cassette system. 

First an HIV gene is obtained from a patient within the first 12 months of infection. The term 
HIV should be understood in the broadest sense and include HIV 1 and HIV 2. It is possible 
to determine the period in which the infection has taken place with an accuracy depending 

10 on the frequency of the blood tests taken from the patient. For example, patients suffering 
from various diseases such as lack of certain factors in their blood or hepatitis have their 
blood tested on a regular basis making it possible to determine the period in which the 
infection has taken place. Apart from patients with diseases wherein blood tests are used to 
monitor the course of the disease, other groups of patients have blood tests taken, e.g. blood 

15 donors. Unfortunately, humans are still infected due to transfer of virus in blood samples, 
medical equipment, etc., making it possible to determine the date where the infection has 
taken place within the time frame of a few days. The importance of obtaining the virus early 
in the course of the infection is due to the known fact that many early isolates share the 
common feature of staying relatively constant in their envelope sequences (Karlsson et a!., 

20 1998). As these early isolates may share cross-reactive antibody- and/or T-cell epitopes a 
vaccine based on such early isolates would have a better chance of inducing immune 
response to shared epitopes of the virus, it is believed that an early, directly cloned virus 
isolate will share immunogenic sites with other early virus isolates seen during an HIV 
infection, so that if a mammal generates antibodies and/or T-cells directed against these 

25 epitopes, the transferred virus will be eliminated prior to the extensive mutations that may 
occur after approximately 12 months of infection. Thus, the virus should be isolated as early 
as possible, that is within the first 12 months of infection, such as 1 1, 10, 9, 8, 7, 6, 5, 4, 3, 2, 
1 , or 0.5 month after infection. 

30 The HIV gene for genetic vaccine is preferably cloned directly from viral RNA or from proviral 
DNA. Direct cloning in this application stands for the virus not being multiplied in stable cell 
lines in vitro. It is presently expected that passing the virus through a stable cell line will 
promote mutation in the virus gene. It is particularly preferred not to pass the virus through 
cells lines selecting for viruses with CXCR4 receptor usage. Direct cloning also includes 

35 multiplication of virus in e.g. PBMC (peripheral blood mononuclear cells) since all virus can 
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multiply in PBMC, and this type of multiplication generally does not select for CXCR4 
receptor usage. Multiplication of virus is often necessary prior to cloning. Preferably cloning 
is performed directly on samples from the patient. In one embodiment of the invention, 
cloning is performed from patient serum. The cloning is then performed directly on the HIV 
5 virus, that is on RNA. In another embodiment of the invention cloning, is performed from 
infected cells. The cloning is then performed on HIV virus incorporated in the genes in an 
infected cell (e.g. a lymphocyte), that is on DNA. In the latter case the virus might be a silent 
virus, that is a non-replicating virus. To evaluate if the virus is silent, capability of 
multiplication in e.g. PBMC is tested. 

10 

Cloning is a technique well known to a person skilled in the art. A first nucleotide sequence is 
hereby obtained. In another aspect of the invention, the first nucleotide sequence, sharing 
the properties mentioned with direct cloning, is obtained by other means. This could be from 
a database of primary isolates or the like. 

15 

Based on the first nucleotide sequence, the amino acid sequence encoded by said 
nucleotide sequence is determined. A second nucleotide sequence encoding the same 
amino acid sequence is then designed utilising the most frequent codons from highly 
expressed proteins in mammalians (e.g. figure 1 presenting the most frequent codons from 
20 highly expressed proteins in humans). 

Presently, it appears that the usage of the most frequent codons from mammalian highly 
expressed proteins has two advantages: 1) the expression is Rev independent; 2) the level 
of expression is high. The Rev independence is especially advantageous when performing 

25 experiments in mice where the Rev systems is functioning sub-optimally. For the use in 
human vaccine, Rev independence and high expression are important to increase the 
amount of antigen produced. The determination of the codons for high expression is in this 
context based on the statistics from human highly expressed proteins (Haas, Park and Seed, 
1996 hereby incorporated by reference). It is contemplated that the expression of a protein 

30 can be even higher, when current research in binding between codon (on the mRNA) and 
anticodon (on the tRNA) reveals codons with optimal binding capabilities, and when 
interactions in-between codons and/or in-between anticodons are known. 

The second nucleotide sequence designed utilising optimised codons is then redesigned to 
35 obtain a third nucleotide sequence. The purpose of the redesigning is to create unique 
restriction enzyme sites around the nucleotide sequence encoding functional regions of the 
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amino acid sequence. By having unique restriction enzyme sites around the nucleotide 
sequence encoding functional regions of the amino acid sequence, the nucleotide sequence 
encoding functional regions of the amino acid sequence can easily be isolated, changed, and 
re-inserted. Examples of functional regions of the amino acid sequence are transmembrane 
5 spanning regions, immunodominant regions, regions with antibody cross reacting domains, 
fusion domains and other regions important for immunogenicity and expression such as 
variable region 1 (V1), variable region 2 (V2), variable region 3 (V3), variable region 4 (V4) 
and variable region 5 (V5). 

10 It is important to select the restriction enzymes sites with care. By changing the second 
nucleotide sequence to insert restriction enzyme sites around the nucleotide sequence 
encoding functional regions of the amino acid sequence, the third nucleotide sequence must 
still code for the same amino acid sequence as the second and first nucleotide sequence do. 
Thus, if necessary, the second nucleotide sequence is redesigned by changing from 

15 optimised codons to less optimal codons. It is understood, that the restriction enzyme sites 
around the nucleotide sequence encoding functional regions of the amino acid sequence 
should preferably be placed in the terminal region of the nucleotide sequence encoding 
functional regions of the amino acid sequence. That is preferably outside the nucleotide 
sequence encoding functional regions of the amino acid sequence, such as 90 nucleotides 

20 away, e.g. 81, 72, 63, 54, 45, 36, 27, 21, 18, 15, 12, 9, 6,3 nucleotides away, but could also 
be inside the nucleotide sequence encoding functional regions of the amino acid sequence, 
such as 54, 45, 36, 27, 21, 18, 15, 12, 9, 6, 3 nucleotides inside the nucleotide sequence 
encoding the functional region of the amino acid sequence. 

25 The type of restriction enzyme sites allowed is determined by the choice of expression 

vector. In certain cases, the number of restriction enzyme sites is limited and it is hard, if not 
impossible, to place unique restriction enzyme sites around all the nucleotide sequences 
coding for functional regions of the amino acid sequence. This problem can be solved by 
dividing the entire nucleotide sequence into pieces, so that each piece comprises only 

30 unique restriction enzyme sites. Modifications to each of the piece is performed separately 
prior to assembly of the pieces. It is preferred that the nucleotide sequence is divided into 9 
pieces. In another aspect, the nuclotide sequence is divided into 8 pieces, or 7, or 6, or 5, or 
4, or 3, or 2 pieces. It is especially preferred that the nucleotide sequence is divided into 3 
pieces. 



35 
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Thus, the redesign of the second nucleotide sequence is an interaction between the choice 
of cloning vector, expression vector, selection of restriction enzyme sites, dividing into 
pieces, and exchange of codons to insert restriction enzyme sites. In a preferred 
embodiment of the present invention the cloning vector is Bluescript allowing the restriction 
5 enzyme sites chosen from the group consisting of: £agl, M/ul, EcoRV, Ps/I, C/al, EcoRI, 
Xba\ t Sac\, Spel, Xho\, H/ncflll, Sacll, Not\ t BamH\ t Sma\, Sa/I, Oral, Kpn\. If other cloning 
vectors are chosen, other restriction enzyme sites will be available as known by the person 
skilled in the art. 

10 As a part of the redesigning of the second nucleotide sequence, selected restriction enzyme 
sites may be removed. The selected restriction enzyme sites to be removed are those sites 
that are sites of the same type as the ones already chosen above and that are placed within 
the same piece. The removal of these restriction enzyme sites is performed by changing 
from optimised codons to less optimal codons, maintaining codons for the same amino acid 

15 sequence. 

The third nucleotide sequence is redesigned so that the terminal snuts contain convenient 
restriction enzyme sites for cloning into an expression vehicle. The expression "vehicle" 
means any nucleotide molecule e.g. a DNA molecule, derived e.g. from a plasmid, 

20 bacteriophage, or mammalian or insect virus, into which fragments of nucleic acid may be 
inserted or cloned. An expression vehicle will contain one or more unique restriction enzyme 
sites and may be capable of autonomous replication in a defined host or vehicle organism 
such that the cloned sequence is produced. The expression vehicle is an autonomous 
element capable of directing the synthesis of a protein. Examples of expression vehicles are 

25 mammalian plasmids and viruses, tag containing vectors and viral vectors such as 

adenovirus, vaccinia ankara, adenoassociated virus, cannarypox virus, simliki forest virus 
(sfv), Modified Vaccinia Virus Ankara (MVA), and simbis virus. In one embodiment of the 
invention, the expression vector contains tag sequences. In another embodiment of the 
invention a bacteria is transformed with an expression plasmid vector and the bacteria is 

30 then delivered to the patient. Preferred expression vehicles are simliki forest virus (sfv), 
adenovirus and Modified Vaccinia Virus Ankara (MVA). 

The snuts are produced by techniques well known by the person skilled in the art. The 
preferred method for synthesising snuts, is herein referred to as "the minigene approach" 
35 wherein complementary nucleotide strands are synthesised with specific overhanging 
sequences for annealing and subsequent ligation into a vector. This can be performed with 
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two sets of complementary nucleotide strands, or with three sets of complementary 
nucleotide strands. The minigene approach minimises the known PCR errors of mismatches 
and/or deletions, which may occur due to hairpins in a GC rich gene with mammalian highly 
expressed codons. In figures 10-21, the production of a representative selection of snuts is 
5 illustrated. 

For the production of long snuts, that is snuts of more than about 240 nucleotides, the 
technique of overlapping PCR is preferred as illustrated in figure 8. Herein two nucleotide 
strands about 130 nucleotides long with an overlap are filled to obtain a double strand, which 
10 is subsequently amplified by PCR. 

For the production of multiple snuts with a length of less than about 210 nucleotides, one 
preferred technique is normal PCR. In a preferred production technique the snuts are 
synthesised with the same 5* flanking sequences and with the same 3' flanking sequences, 
15 as illustrated in figure 9. The advantages of this approach is, that the same PCR primer set 
can be used for amplification of several different snuts. 

As known by the person skilled in the art, special conditions have to be used for each 
individual PCR reaction and it should be optimised to avoid inherent problems like deletions 
20 mismatches when amplificating GC rich genes from synthetic ssDNA material. Whichever of 
the above mentioned techniques are used, it is well known by the person skilled in the art, 
that it will be necessary to correct unavoidable mismatches produced either due to the 
nucleotide strand synthesis material and/or the PCR reaction. This can be performed by site 
directed mutagenesis techniques. 

25 

After the various snuts have been produced, they are assembled into pieces and 
subsequently into the complete gene. Methods for assembly (such as ligation) are well 
known by the person skilled in the art. 

30 

In a preferred embodiment of the present invention the HIV gene encodes the entire HIV 
envelope. It is understood that the HIV envelope can be the full length envelope gp160 as 
well as shorter versions such as gp150, gp140, and gp120 with or without parts of gp41. 

35 As will be known by the person skilled in the art, the HIV is divided into several groups. 
These groups presently include group M, group O, and group N. Further, the HIV is divided 
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into subtypes A, B, C ( D, E, F, G ( H, I, and J. In the present invention subtype B is preferred 
due to the high prevalence of this subtype in the Western countries. 

The determination of groups and subtypes is based on the degree of nucleotide sequence 
5 identity in the envelope gene is presently defined as follows: If the sequence identity is more 
than 90% the viruses belong to the same subtype; If the sequence identity is between 80% 
and 90% the viruses belong to the same group. If the sequence identity is less than 80% the 
viruses are considered as belonging to different groups. 

10 One aspect of the invention relates to a nucleotide sequence construct in isolated form which 
has a nucleotide sequence with the general formula (I), (II), (III), or (IV) 

(I) PrS495ClarS650-720EcoRrP2-Si265gp120 

(II) PrS 4 95ClarSe50-720EcoRrP2-Si265Xhor Si465Pstr P4gp140 

(III) PrS 4 95CtarS650-720EcoRrP2-Sl265Xhor Si465Pstr P4gp150 

15 (IV) Pi-S 4 95ClarS650-720EcoRrP2-Sl265Xhor Si465Pstr P4gp16ET S 2 060Sacir P5 

wherein Pi designates the nucleotide sequence SEQ ID NO:41, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 

wherein S 4 9 5 ciai designates the nucleotide sequence SEQ ID NO: 7, a nucleotide sequence 
20 complementary thereto, or a nucleotide sequence with a sequence identity of at least 95% 
thereto; 

wherein S 65 o.72oecori designates the nucleotide sequence SEQ ID NO: 9, a nucleotide 
sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 95% thereto; 

25 wherein P 2 designates the nucleotide sequence SEQ ID NO: 43, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein Si265g P i2o designates the nucleotide sequence SEQ ID NO: 19, a nucleotide 
sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
30 least 70% thereto; 

wherein SiMsxhoi designates the nucleotide sequence SEQ ID NO: 17, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 80% 
thereto; 

wherein S 146 5Psti designates the nucleotide sequence SEQ ID NO: 23, a nucleotide sequence 
35 complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 
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wherein P 4gp i 40 designates the nucleotide sequence SEQ ID NO: 57, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4 gpi 5 o designates the nucleotide sequence SEQ ID NO: 55, a nucleotide sequence 
5 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4 g P ieo designates the nucleotide sequence SEQ ID NO: 53, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

10 wherein S 20 6osacii designates the nucleotide sequence SEQ ID NO: 33, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 98% thereto; and 

wherein P 5 designates the nucleotide sequence SEQ ID NO: 59, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
15 thereto. 

The design of the parent synthetic BX08 gp160 envelope cassette gene with its variant 
length genes gp150, gp140, gp120 is outlined in figure 2. 

20 The nucleotide sequence construct with the formula (I) 

( I ) Pi -S 49 5ciar Sb50.720EcoRP P2-S 1 265gp 1 20 

(visualised in figure 3) (SEQ ID NO: 73) codes for the amino acid sequence of gp120 (SEQ 
ID NO: 74). This amino acid sequence is the part of the HIV envelope that is secreted. Thus, 
it contains the immunogenic epitopes without being bound to the cell membrane. This is of 
25 particular advantage if the nucleotide sequence construct is used for production of 

recombinant antigens or for a DNA vaccine as the antibody immune response may be higher 
to secreted versus membrane bound HIV antigens. 

The nucleotide sequence construct with the formula (II) 

30 (II) P i-S 49 5ClarSe50-720EcoRrP2-Si265Xhor Si465Pstr P 4 gp140 

(visualised in figure 4) (SEQ ID NO: 67) codes for the amino acid sequence of gp140 (SEQ 
ID NO: 68). This amino acid sequence encodes the gp120 and the extracellular part of the 
gp41 protein. The amino acid sequence is secreted due to the lack of the transmembrane 
spanning region. This is of particular advantage if the nucleotide sequence construct is used 
35 for production of recombinant antigens as the immunogenic and/or antigenic epitopes in the 
extracellular part of gp41 are included and is of particular advantage for a DNA vaccine as 
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the antibody immune response may be higher to secreted gp120 versus membrane bound 
HIV antigens. 

The nucleotide sequence construct with the formula (II!) 

5 (III) Pl-S495ClarS 6 50-720EcoR|-P2-Si265Xhor Si 4 65Pstr P4gp150 

(visualised in figure 5) (SEQ ID NO: 69) codes for the amino acid sequence of gp150 (SEQ 
ID NO: 70). This amino acid sequence contains all of the envelope protein gp160 except the 
c-terminal tyrosin containing intemalisation signals in the intracellular part of gp41. The 
membrane bound surface expression of the amino acid sequence is thereby maintained and 
10 enhanced. Mimicking the organisation of the native epitope conformation may by expected, 
making this nucleotide sequence construct of particular advantage if the nucleotide 
sequence construct is used as a vaccine. 

The nucleotide sequence construct with the formula (IV) 

15 (IV) Pl-S 4 95ClarS650-720EcoRrP2-Si285Xhor S 14 65Pstr P4gp160" S 2 060Sactr P5 

(visualised in figure 6) (SEQ ID NO: 71) codes for the amino acid sequence of gp160 (SEQ 
ID NO: 72) i.e. the entire envelope. 

The nucleotide sequence construct designated Pi comprises the nucleotide sequence 
20 encoding the amino acid sequence in the first variable region (V1) and the amino acid 
sequence in the second variable region (V2). In one embodiment of the invention the first 
variable region is surrounded by EcoRV and Pstl restriction enzyme sites, and the second 
variable region is surrounded by Pstl and Clai restriction enzyme sites but as stated above, 
the choice of restriction enzyme sites can alter. 

25 

The nucleotide sequence construct designated S 65 o-72oecori comprises the nucleotide 
sequence encoding the amino acid sequence in the third variable region (V3). In one 
embodiment of the present invention S 65 o-72oecori is characterised by the restriction enzyme 
sites EcoRI and Xbal in the terminals. 

30 

The nucleotide sequence construct designated P 2 comprises the nucleotide sequence 
encoding the amino acid sequence of the fourth variable and constant region (V4 and C4). In 
one embodiment of the present invention the forth variable region is surrounded by Sad and 
Xho\ restriction enzyme sites. 

35 
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The nucleotide sequence construct designated S 12 65g P i2o comprises the nucleotide sequence 
encoding amino acid sequence of the fifth variable and constant region (V5 and C5). 
Si265g P i2o further comprises a nucleotide sequence encoding a C-tenminal stop codon. 

5 The nucleotide sequence construct designated P 4 gp 14 o comprises the nucleotide sequence 
encoding amino acid sequence of the transmembrane spanning region. P 4gp i4o further 
comprises a nucleotide sequence encoding a C-terminal stop codon prior to the 
transmembrane spanning region. 

10 The nucleotide sequence construct designated P4gpieo comprises the nucleotide sequence 
encoding amino acid sequence of the transmembrane spanning region (trans membrane 
spanning domain: TMD). In a preferred embodiment of the present invention the 
transmembrane spanning region is surrounded by Hindlll and Sacll restriction enzyme sites. 

15 The term "sequence identity" indicates the degree of identity between two amino acid 

sequences or between two nucleotide sequences calculated by the Wilbur-Lipman alignment 
method (Wilbur et al, 1 983). 

The nucleotide sequence constructs with the formula (I), (II), (III), or (IV) illustrates the 
20 flexibility in the present invention. By producing a gene with the described method enables 
the production of a plethora of antigens with various immunogenic epitopes and various 
advantages for production and vaccine purposes. To further illustrate the flexibility of the 
invention, other changes and mutations are suggested below. 

25 In order to improve the immunogenicity of the nucleotide sequence constructs of the 
invention it is suggested to change the nucleotide sequence such that one or more 
glycosylation sites are removed in the amino acid sequence. By removal of shielding 
glycosylations, epitopes are revealed to the immunesystem of the mammal rendering the 
construct more immunogenic. The increased immunogenicity can be determined by an 

30 improved virus neutralisation. Changes in the nucleotide sequence such that one or more re- 
linked glycosylation sites are removed in the amino acid sequence is well known by the 
person skilled in the art. Potential glycosylation sites are N in the amino acid sequences N-X- 
T or N-X-S (wherein X is any amino acid besides P). The glycosylation site can be removed 
by changing N to any amino acid, changing X to a P, or changing T to any amino acid It is 

35 preferred that N is changed to Q by an A to C mutation at the first nucleotide in the codon, 
and a C to G mutation at the third nucleotide in the codon. This is preferred to increase the 
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GC content in the nucleotide sequence construct. As an alternative N is changed to Q by an 
A to C mutation at the first nucleotide in the codon, and a C to A mutation at the third 
nucleotide in the codon. Preferred mutations in the synthetic BX08 envelope gene to remove 
potential N-linked glycosylation sites in V1 and/or V2 are A307C + C309A and/or A325C + 
5 C327G and/or A340C + C342A and/or A385C + C387A and/or A469C + C471A. Examples 
of such changes is illustrated in SEQ ID NOs: 47, 49, and 51. 

For historical reasons the HIVs have been divided into syncytia inducing strains and non 
syncytia inducing strains. The assay to determine whether a strain is syncytia inducing is 

10 described in Verrier et al 1997, hereby incorporated by reference. It is presently known, that 
viruses utilising the CXCR4 co-receptor are syncytia inducing strains. It is also, at the 
present, known that the binding site for the CXCR4 involves the third variable region (V3). In 
a preferred embodiment the nucleotide sequence construct is changed to create a binding 
site for the CXCR4 co-receptor. It is presently performed in the third variable regions, 

15 preferably by the mutation G865C + A866G. 

It is well established that the HIV envelope comprises immunodominant epitopes. An 
immunodominant epitope is an epitope that most antibodies from the mammal are directed 
against. The antibodies directed against these immunodominant epitopes may have little 

20 effect in elimination of the virus. It is therefore anticipated that modification of the 

immunodominant epitopes will induce antibodies directed against other parts of the envelope 
leading to a better elimination and neutralisation of the virus. By modification is understood 
any change in the nucleotide sequence encoding an immunodominant epitope in the amino 
acid sequence such that said amino acid sequence no longer contains an immunodominant 

25 epitope. Thus, modification includes removal of the immunodominant epitope and decrease 
of immunogenicity performed by mutagenesis. In a preferred embodiment of the present 
invention an immunodominant epitope in the third variable region (V3) is modified, such as 
deleted or altered. In a much preferred embodiment the nucleotides 793-897 are deleted. 
In yet another preferred embodiment of the present invention an immunodominant epitope 

30 has been removed from gp41 , such as deleted. This is performed in P 7 or P 8 by elimination 
of the nucleotides 1654-1710. 

It is anticipated that when gp120 is dissociated from gp41 in a vaccine or antigen, two 
immunodominant epitopes, one on each protein, are exposed and antibodies are directed 
35 against these in the mammal. In the infectious virus, gp120 is coiled on top of gp41 and the 
gp120/gp41 is most likely organised in a trimer, so that these immunodominant epitopes are 
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hidden and therefore less elimination of virus is observed. By removing the cleavage site 
between gp41 and gp120 a full length gp160, gp150, or gp140 can be obtained with a 
covalent binding between gp41 and gp120. Removal of the cleavage site between gp41 and 
gp120 is preferably performed by a mutation at position C1423A. An example of such a 
5 mutation is illustrated in the mutation of S 126 5Xhoi (SEQ ID NO: 17) to S 12 e5g P i6o (SEQ ID NO: 
21). 

In order to stabilise the full length gp160, gp150, and gp140 for example when the cleavage 
site between gp41 and gp120 has been removed as described above, cysteins can be 
10 inserted, preferably inside the gp41 helix creating disulphide bounds to stabilise a trimer of 
gp41s. In a preferred embodiment of the present invention the cysteins are inserted by the 
mutation 161 8:CTCCAGGC: 1625 to 1618:TGCTGCGG:1625. An example of such a change 
is illustrated in SEQ ID NO: 25. 

15 The above mentioned decrease in immunodominant epitopes combined with the increase in 
immunogenicity of the other epitopes is expected to greatly enhance the efficacy of the 
nucleotide sequence construct as a vaccine. 

During the production of the nucleotide sequence construct, it is convenient to ligate the 
20 snuts into pieces. The pieces, as described above, are characterised by their reversible 
assembly as there are no duplicate restriction enzyme sites. In a preferred embodiment one 
piece (herein designated P 3 ) contains Pi, S 4 95ciai,S 6 5o-720EcoRi, and P 2 . Another piece (herein 
designated P 8 ) contains S^esxhoi. S 14 65Psu, and P 4gp i6o- Yet another piece (herein designated 

P7) COntainS S-j265Xhot> Si465Pstl. P4gp160> S2060Sacll> anC ^ 

25 

One advantage of the present nucleotide sequence construct is the easy access to 
exchange and alterations in the content and function of the nucleotide sequence and the 
encoded amino acid sequence. In one embodiment the nucleotide sequence coding for a 
functional region or parts thereof of the amino acid sequence is repeated. The repeat could 
30 be back-to-back or a functional region or parts thereof could be repeated somewhere else in 
the sequence. Repeated could mean two (one repetition) but could also be three, six, or nine 
repeats. In a much preferred embodiment the repetition nucleotide sequence codes for 
amino acids in the third variable region. 

35 In order to improve the protective capabilities of the invention against infections with HIV, 
one embodiment of the invention relates to the combination of epitopes. The present 
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nucleotide sequence construct allows insertion of one or more new nucleotide sequences 
isolated from another group and/or subtype of HIV and/or isolated from another patient. 
Hereby a vaccine or antigen with two or more epitopes from two or more HIVs is obtained. In 
a preferred embodiment, the V3 is replaced by the new nucleotide sequence. In a much 
5 preferred embodiment, the new nucleotide sequence codes for amino acids in the third 
variable region of a different HIV isolate. 

In order to improve the efficacy of the vaccine, aiming at raising cellular immunity, a 
nucleotide sequence coding for a T-helper cell epitope is included in the nucleotide 

10 sequence construct. The nucleotide sequence coding for a T-helper cell epitope or a T- 
helper cell epitope containing amino acid sequence can be put in anywhere in the nucleotide 
sequence construct as long as it does not interact with the function of the envelope molecule. 
However, it is preferably placed in the tail of the nucleotide sequence construct or between 
the leader sequence and the envelope gene. The T-helper epitopes are preferably selected 

15 from core proteins such as P24gag or from a non-HiV pathogen such as virus, bacteria, e.g. 
BCG antigen 85. For a therapeutic vaccine an HIV helper epitope is preferred since the 
patient is already primed by the HIV infection. For a prophylactic vaccine, a T-helper cell 
epitope from a frequently occurring non HIV pathogen such as Hepatitis B, BCG, CMV, EBV 
is preferred. Also, since the synthetic BX08 envelope genes may contain T-helper cell 

20 epitopes in addition to important antibody epitopes, the synthetic BX08 vaccine genes can be 
mixed with other DNA vaccines to improve the efficacy of the other DNA vaccine. 

One aspect of the present invention relates to individualised immunotherapy, wherein the 
virus from a newly diagnosed patient is directly cloned, the envelope or subunits 

25 corresponding to snuts or pieces is produced with highly expressed codons, inserted into any 
of the nucleotide sequence constructs described above and administered to the patient as a 
vaccine. Hereby a therapeutic DNA vaccine is obtained, that will help the patient to break 
immunetolerance or induce/reinduce an appropriate immune response. In one embodiment 
the variable regions of the virus are produced with highly expressed codons and exchanged 

30 into any of the nucleotide sequence constructs described above. 

In one embodiment of the invention, the nucleotide sequence construct as described above 
satisfies at least one of the following criteria: 

a) serum extracted from a Macaque primate which has been immunised by administration of 
35 an expression vector containing the nucleotide sequence construct is capable of eliminating 
SHIV as determined by quantitative PGR and/or virus culturing. 
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b) serum extracted from a primate which has been immunised by administration of an 
expression vector containing the nucleotide sequence construct is capable of neutralising 
HIV-1 BX08 and /or other HIV-1 strains in vitro. 

c) serum, extracted from a mouse which has been immunised by administration of an 

5 expression vector containing the nucleotide sequence construct four times in intervals of 
three weeks and boosted after 15 weeks, is capable of decreasing the concentration of HIV- 
antigen in a culture of HIV, serum or PBMCs by at least 50%. An example of such procudure 
is shown in example 9. 

10 In one embodiment of the invention, the nucleotide sequence construct of the invention, is 
used in medicine. That is, it is used as a vaccine, for the production of a recombinant protein, 
such that the recombinant protein is used as a vaccine, or the nucleotide sequence construct 
or the recombinant protein is used in a diagnostic composition. 

Thus, the nucleotide sequence construct of the invention can be used for the manufacture of 
15a vaccine for the prophylactics of infection with HIV in humans. 

Intramuscular inoculation of nucleotide constructs, i.e. DNA plasmids encoding proteins have 
been shown to result in the generation of the encoded protein in situ in muscle cells and 
dendritic cells. By using cDNA plasmids encoding viral proteins, both antibody and CTL 

20 responses were generated, providing homologous and heterologous protection against 
subsequent challenge with either the homologous or cross-strain reaction, respectively. 
The standard techniques of molecular biology for preparing and purifying DNA constructs 
enable the preparation of the DNA therapeutics of this invention. While standard techniques 
of molecular biology are therefore sufficient for the production of the products of this 

25 invention, the specific constructs disclosed herein provide novel therapeutics which can 
produce cross-strain protection, a result heretofore unattainable with standard inactivated 
whole virus or subunit protein vaccines. 

The amount of expressible DNA to be introduced to a vaccine recipient will depend on the 
strength of the transcription and translation promoters used in the DNA construct, and on the 

30 immunogenicity of the expressed gene product. In general, an immunologically or 

prophylactically effective dose of about 10 jig to 300 ng is administered directly into muscle 
tissue. Subcutaneous injection, intradermal introduction, impression through the skin, 
inoculation by gene gun preferably DNA coated gold particles, and other modes of 
administration such as intraperitoneal, intravenous, peroral, topic, vaginal, rectal, intranasal 

35 or by inhalation delivery are also contemplated. It is also contemplated that booster 
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vaccinations are to be provided. It is further contemplated that booster vaccinations with 
recombinant antigens are to be provided, administered as described above. 

The DNA may be naked, that is, unassociated with any proteins, adjuvants or other agents 
5 which impact on the recipients immune system. In this case, it is desirable for the DNA to be 
in a physiologically acceptable solution, such as, but not limited to, sterile saline or sterile 
buffered saline. Alternatively, the DNA may be associated with surfactants, liposomes, such 
as lecithin liposomes or other liposomes, such as ISCOMs, known in the art, as a DNA- 
liposome mixture, (see for example WO93/24640) or the DNA may be associated with and 
10 adjuvant known in the art to boost immune responses, such as a protein or other carrier. 
Agents which assist in the cellular uptake of DNA, such as, but not limited to, calcium ions, 
detergents, viral proteins and other transfection facilitating agents may also be used to 
advantage. These agents are generally referred to as transfection facilitating agents and as 
pharmaceutical^ acceptable carriers. 

15 

Those skilled in the field of molecular biology will understand that any of a wide variety of 
expression systems may be used. A wide range of suitable mammalian cells are available 
from a wide range of sources (e.g. the American Type Culture Collection, Rockland, Dm; 
also, see e.g. Ausubel et at. 1992). The method of transformation or transfection and the 
20 choice of expression vehicle will depend on the host system selected. Transformation and 
transfection methods are described e.g. in Ausubel et at 1992; expression vehicles may be 
chosen from those provided e.g. in P.H. Pouwels et al. 1985. 

in one embodiment of the present invention the protein encoded by the nucleotide sequence 
25 construct is produced by introduction into a suitable mammalian cell to create a stably- 
transfected mammalian cell line capable of producing the recombinant protein. A number of 
vectors suitable for stable transfection of mammalian cells are available to the public e.g. in 
Cloning Vectors: A Laboratory manual (P.H. Pouwels et al. 1985); methods for constructing 
such cell lines are also publicly available, e.g. in Ausubel et al. 1992. 

30 

Standard reference works describing the general principles of recombinant DNA technology 
include Watson, J.D. et al 1987; Darnell, J.E. et al 1986; Old, R.W. et al, 1981; Maniatis.T. et 
al 1989; and Ausubel et al.1992. 
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Figure legends 

The invention is further illustrated in the following non-limiting examples and the drawing 
wherein 

5 Figure 1 provides the codon preference of highly expressed proteins in human cells. 

Figure 2 illustrates the outline of gp120, gp140, gp150, and gp160 encoding synthetic genes 
derived from the wild type sequence at the top. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain 
10 (TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 
The approximate position of the three restriction enzyme sites dividing the full-length 
gp160 gene into the three pieces each containing only unique restriction enzyme sites are 
shown in bold. 

15 Figure 3 building of the synthetic gp120 gene. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 4 building of the synthetic gp140 gene. Variable (V) and constant (C) regions are 
20 shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 5 building of the synthetic gp1 50 gene. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
25 TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

Figure 6 building of the synthetic gp160 gene. Variable (V) and constant (C) regions are 
shown together with the leader peptide (LP) and the transmembrane spanning domain ( 
TMD). The approximate nucleotide positions of the restriction enzyme sites are shown. 

30 

Figure 7 illustrates the codons coding amino acids in general 
Figure 8 illustrates how overlapping PCR is performed. 
35 Figure 9 illustrates how PCR using conserved flanking ends is performed. 
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Figure 10 illustrates how S 126 5xhoi is produced using complementary strands (minigene- 
approach) technology. The S 1265X hot is ligated from three sets of complementary strands 
into the vector pBluescript KS + between restriction enzyme sites Xho\ and Pst\. 

5 

Figure 1 1 illustrates how S 1465 p s ti is produced. The same approach, as the approach used for 
the production of S 1265 xhoi, was used except that only two sets of complementary strands 
were used. 

10 Figure 12 illustrates the assembly of P,. The So- N -Lang and S^secorv are ligated into the Xba\ 
and Pst\ site of the S 37 5Psti containing plasmid. 

Figure 13 illustrates the assembly of P 2 . The S 900 xbai was excelled by HindlU and Sad from 
its plasmid and ligated with Swosaci (Sacl-Spel) into the S 110 s pa i plasmid that was opened 
15 at the HindUl and Spel sites. 

Figure 14 illustrates the assembly of P 3 . S 495C iat (Cla\-EcoR\) and S 650 . 72 oecori (EcoR\-Xba\) 
and P 2 (Xba\-Xho\) were ligated simultaneously into the P, plasmid opened at the C/al 
and Xho\ sites to obtain the P 3 plasmid. 

20 

Figure 15 illustrates the assembly of P 4gp160 . S 1Q90Hi ndii] (Sacl-HindUl) and S 170 oeagi (HindUl- 
Eag\) were ligated simultaneously into the S 1630 xhai plasmid opened by Sacll and Eagl. 

Figure 16 illustrates the assembly of P 5 . S 2190C .ai (CIa\-Pst\) and S 2330 Pstt (Pst\-EcoR\) were 
25 ligated into the S 2425Es plasmid opened by C/al and EcoRI. 

Figure 17 illustrates the assembly of P 8g pi 6 o. S 146 5Psti (Xba\-Pst\) and S 1265 xhoi (Pst\-Xho\) were 
ligated into the P 4gp i6o plasmid opened by Xba\ and Xho\. 

30 Figure 18 illustrates the assembly of P 8gp15 o. S 1465Psl) (Xba\-Pst\) and S 1265X hoi (Pst\-Xho\) were 
ligated into the plasmid containing P 4gp150 with the stop codon. P 4gpl50 plasmid was opened 
at the Xba\ and Xho\ sites for the ligation. 

Figure 19 illustrates the assembly of P 8gp140 . S 1465Pstt (Xba\-Pst\) and S 1265X hoi (Pst\-Xho)) were 
35 ligated into the plasmid containing P 4gp140 with a stop codon. P 4gp140 plasmid was opened 
at the Xba\ and Xho\ sites for the ligation. 
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Figure 20 illustrates the assembly of Psgp4i. Two complementary nucleotide strands 
1265gp41S and 1265gp41AS designed with overhang creating a 5' Xho\ and a 3' Pst\ 
restriction enzyme site were anealed and ligated into the piece 8 which is already opened 
5 at the Xho\ and Psft sites whereby Si 26 s is deleted. 

Figure 21 illustrates the assembly of P 7 . P 8 (Xho\-Sac\\) and Sjoeosacii (Sac\\-Cla\) were 
ligated into P 5 plasmid opened at Xho\ and C/al. 

10 Figure 22a SDS PAGE of 35 S-labelied HIV-1 BX08 envelope glycoproteins radio-immuno 

precipitated from transiently transfected 293 cells using the indicated plasmids. Cell pellet 
(membrane bound antigens) or cell supernatant (secreted antigens) were precipitated by 
a polyclonal anti-HIV-1 antibody pool. Lane 1: untransfected cells. Lane 2: supernatant 
from syn.gp120 MN transfected cells. Lane 3: cell pellet from wt.gp160 B xos transfected cells. 

15 Lane 4: cell pellet from cells co-transfected by wt.gp160 B xo8 and pRev. Lane 5: Mwt. 
marker. Lane 6: cell pellet from syn.gp160 B xo8 transfected 293 cells. Lane 7: cell pellet 
from syn.gp150 B xoa transfected 293 cells. Lane 8: supernatant from syn.gp140 B xoa 
transfected cells. Lane 9: supernatant from syn.gp120 B xos transfected cells. 

20 Figure 22b is an SDS-PAGE of 35 S-labeled HIV-1 BX08 envelope glycoproteins radio- 
immune precipitated from transiently transfected 293 cells as cell pellet (membrane 
bound) or cell supernatant (secreted antigens) by anti-HIV-1 antibody pool using the 
indicated plasmids. Lane 1: untransfected 293 cells. Lane 2: cell pellet from syn.gp160MN 
transfected 293 cells as positive control (Vinner et al 1999). Lane 3: Cell supernatant from 

25 syn.gp120MN transfected 293 cells as positive control (Vinner et al 1999). Lane 4: Cell 
supernatant from syn.gp120BX08 transfected 293 cells demonstrating a glycoprotein 
band of 120 kDa. Lane 5: Cell supernatant from syn.gp140BX08 transfected 293 cells 
demonstrating a glycoprotein band of 120 kDa. Lane 6: Mwt. marker. Lane7 at two 
different exposure times: Cell pellet from syn.gp150BX08 transfected 293 cells 

30 demonstrating a glycoprotein band of 120 kDa (lower gp30 band is not well seen in this 
exposure). Lane 8: Cell supernatant from syn.gp150BX08 transfected 293 cells showing 
no secreted proteins (all protein is membrane bound, see lane 7). 

Figure 22c show fluorescent microscopy of U87.CD4.CCR5 cells transfected with BX08 
35 gp160 genes plus pGFP. Panel A: cells transfected with empty WRG7079 vector plus 
pGFP showing no syncytia. Panel B: cells transfected with wild type BX08gp160 gene 
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plus pGFP showing some syncytia. Pane! C: celts transfected with synBX08gp160 plus 
pGFP showing extreeme degree of syncytia formation. This demonstrates expression, 
functionality, and tropism of the expressed BX08 glycoprotein with much more expressed 
functionally active gp160 from the synthetic BX08 gene. 

5 

Figure 23 shows the anti-Env-V3 BX08 antibody titers (lgG1). Panels show individual mice 
DNA immunized with syn.gp140BX08 plasmid either i.m. (left panel) or by gene gun (right 
panel), respectively. Immunization time points are indicated by arrows. 

10 Figure 24 shows a Western Blotting of (from left to right) one control strip, followed by sere 
(1:50) from 2 mice i.m. immunized with synBX08gp120, 2 mice i.m. immunized with 
synBX08gp140, 2 mice i.m. immunized with synBX08gp150, and 2 mice immunized with 
synBX08gp160, followed by 2 mice gene gun immunized with synBX08gp120, 2 mice 
gene gun immunized with synBX08gp140, 2 mice gene gun immunized with syn 

15 BX08gp150, and 2 mice gene gun immunized with synBX08gp160 respectively. Strip 5 is 
a mouse 5.1 DNA immunized i.m. with synBX08gp140 plasmid (same mouse as in figure 
23). Plasma was examined at week 18. The positing of gp160 (spiked with four coupled 
gp51), gp120 and gp41 is indicated at the right. A positive reaction to HIV glycoproteins 
futher demonstrates the mouse anti-HIV immunoglobulin reacting to HIV of a strain (IIIB) 

20 different from BX08 to illustrate cross-strain reactivity. 

Figure 25 Theoretical example of calculation of the 50% inhibitory concentration (IC 50 ) 
values. IC 50 for each mouse serum is determined by interpolation from the plots of percent 
inhibition versus the dilution of serum. 

25 

Figure 26 CTL responses were measured at week 18 to the mouse H-2D d restricted BX08 
V3 CTL epitope (IGPGGRAFYTT) for BALB/c mice (H-2D d ) i.m. immunized at week 0, 9, 
and 15 with the synthetic vaccine genes: syn.gp120 BX0 8, syn.gp140 B xo8. syn.gp150 B xoB, 
and syn.gp160 B xos, respectively, and median values of different E:T ratios for groups of 
30 mice are shown (26A). Intramuscular DNA immunization with syn.gp150 B xoe induced a 
higher CTL reponse when injected i.m. in high amounts versus gene gun inoculation of 
skin (26B). 

Figure 27 Summary of western immuno blotting assay of mice sera (1 :40) collected at week 
35 0, 9, and 18 from mice genetically immunized with syn.gp120 BX0 8, syn.gp140 BX0 8, 
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syn.gp15(W syn.gpl 60 BX08 , wt.gp160 BX0B , and wt.gp160 BX08 plus pRev, respectively. 
Percent responders in groups of 17-25 mice against gp120 and gp41 are shown. 

Figure 28 IgG anti-rgp120 (IIIB) antibody titers of individual mice inoculated at week 0, 9 t 15 
5 (28A), or gene gun immunized at week 3, 6, 9, and 15 (28B) with the syn.gpl 50 BX o8 DNA 
vaccine. 

Figure 29 IgG antibody titers to HIV-1 rgp120ni B . Median titers are shown from groups of 
mice i.m. inoculated at week 0, 9, and 15 (29A). or gene gun immunized at week 0, 3, 6, 
10 9, and 15 (29B) with the synthetic genes syn.gpl 20 B xoe, syn.gpl 40 BX0 8, syn.gpl 50 BX08 , 
and syn.gp160 B xoe, respectively. 
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Examples 

Example 1: Designing the nucleotide sequence construct 

Initially the overall layout of the nucleotide sequence construct is decided. The overall layout 
comprises the various derivatives the gene will be expressed as. For BX08 these include, but 
5 are not restricted to gp160, gp150, gp140, gp120, and gp41 . 

Next, the vehicle of expression (plasmid or virus) is to be determined: Preparation for a 
suitable vector determines both need for leader sequence, terminal restriction enzyme sites 
and whether or not an N- or C-terminal protein tag is to be considered (Poly-his, Myc- 
antibody-epitop, etc.). For BX08 a plasmid expression vehicle was chosen. All native wild 

10 type HIV codons are systematically exchanged with the codons most frequently represented 
in a pool of highly expressed human genes (figure 1). By this exchange the amino acid 
sequence is conserved while the nucleotide sequence is dramatically altered. Thus, gene 
structures like overlapping reading frames (e.g. vpu, rev, and tat) or secondary structures 
(e.g. RRE) are most likely destroyed whereas protein cleavage sites, and glycosylation sites 

15 are maintained. The 100% amino acid identity between wtBX08 and synthetic BX08 in the 
present examples should be calculated after the initial Ala-Ser amino acid sequence, as that 
sequence is a part of the 6 amino acid sequence long Nhe\ restriction enzyme site. 

Depending on the restriction enzyme sites located in the expression vector it is decided 
20 which restriction enzyme sites can be present (tolerated) throughout the finished gene 

construct. The terminal restriction enzyme sites of the synthetic gene must remain unique to 
enable cloning into the vector chosen for expression. General requirements for restriction 
enzyme sites of choice: Preferably creating cohesive ends facilitating ligation, creating no 
compatible ends with adjacent restriction enzyme sites (e.g. BamHI/Bsr/II), and being efficient 
25 cutters. For BX08 the restriction enzyme sites accepted were the ones present in the 
polylinker of the pBluescript cloning vectors (Eag\ t Mlu\, EcoRV, Ps/I, C/al, EcoRI, Xba\, 
Sad, Spel, Xho\, HindlU, Sacll A/of I, BamHI, Smal, Sa/I, Oral Kpn\ with the exception of 
Bg/ll and Nhe\). This was decided to satisfy the original cloning strategy using individual 
cloning of snuts in pBluescript with restriction enzyme cleaved (trimmed) ends after PCR 
30 amplification, which is not necessary when blunt-end cloning and assembling of 

complementary oligonucleotides are employed. All locations at which the selected restriction 
enzyme sites can be introduced by silent mutations (keeping 100% loyal to the amino acid 
sequence) are identified using the SILMUT software or equivalent. 
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From these possible restriction enzyme sites, a selection of restriction enzyme sites are 
introduced by silent nucleotide substitutions around functional regions of choice of the 
corresponding gene (e.g. RRE) or gene products (e.g. variable region 1 (V1), V2, V3, CD4 
binding area, transmembrane domain, and regions of immunological significans, etc.). 
5 Restriction enzyme sites are located at terminal positions of subcloned snuts (building 
entities) but additional restriction enzyme sites may be present within subunits. For BX08 the 
construct was initially to be cloned in the WRG7079 vector containing a tPA-leader 
sequence. Cloning sites were 5'-Nhe\ Ba/nHI-3\ The entire humanised BX08 sequence 
was divided into thirds: 5'-Nhe\ -+ Xho\ -> Sacll Ba/nHI-3'. These sites were chosen in 
10 this particular order because it resembles the polylinker of pBluescript (KS ) enabling 
successive ligations of the assembled thirds in this cloning vector. Within these thirds 
restriction enzyme sites were kept unique. Next, restriction enzyme sites were placed to flank 
the functional regions chosen as follows: 

A. (5-V1): EcoRV-235: Between C1 and V1 . Alternatives: 3xHind\\\ (already excluded 
1 5 because exclusive use at position 1 890) or EcoRV. 

B. (V1-3*): Only alternative Pst\ 375. 

C. (5'-V2): as B. 

D. (V2-3'): Alternatives: Spel, C/al 495. C/al chosen because it is closer to V2. 

E. EcoRI 650 placed because next possible site was too far away. 

20 F, (5'-V3): Bg/ll 720 was the alternative closest to the V3 region and further more unique. 

G. (V3-3'): Alternatives Xho\ (excluded) and Xba\ 900 located very close to the V3 loop. 

H. (5'-V4) Sad 990: alternatively EcoRI or BamHI (both excluded) 

I. (V4-3'): Alternatives Spel 1110, Kpn\ 1145, P$t\ 1135. Pst\ already used, Spel chosen 
because of distance to previous site (Sad 990). 

25 J. (5-V5): Xho\ initially determined. 

K. (Fusion peptide-3') Pst\ 1465 was the closest alternative to Xho\ 1265. 

L. (5-lmmunodominant region): Xba\ 1630 chosen among EcoRV (blunt end), Pst\ and Xho\ 

(both already used). 

M. (Immunodominant region-3'): Eag\ 1700 perfect location. 
30 N. (C34 and C43 -3' (Chan, Fass, et al. 1997), and 5'-trans membrane domain): Sacll. No 
alternatives. 

O. (Trans membrane domain -3'): Sacll 2060 already present. 
P. C/al 2190 perfect position in relation to previous RE-site. 
Q: Pstl perfect position in relation to previous RE-site. 
35 R. EcoRI 2400 introduced to facilitate later substitution of terminal snut. 
S. SamHI 2454 determined by the WRG7079 vector. 
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Remove undesired restriction enzyme sites by nucleotide substitutions (keeping loyal to the 
amino acid sequence). Nucleotide substitution should preferably create codon frequently 
used in highly expressed human genes (figure 1). If that is not possible, the codons should 
5 be the selected from the regular codons (figure 7). The substitutions made to the second 
nucleotide sequence to obtain desired restriction enzyme sites are shown in Table 2. 



Table 2 lists silent nucleotide substitutions in the humanised BX08 envelope 
1 0 sequence. Substitutions were made to create or delete restriction enzyme sites. 



Position: 


substitution 


Remarks: 


138 


c g 


creates Mtu I site on pos. 134-139 


240 


c-M 


creates EcoRV site on pos. 238-243 


501 


c a 


creates Cla I site on pos. 501-506 


502 


a -> t 


do 


503 


g c 


do 


504 


c->g 


do 


657 


c -> a 


creates EcoRI site on pos. 656-661 


660 


c->t 


do 


724 


c -> a 


creates Bgl II site on pos. 724-729 


726 


c g 


do 


727 


a->t 


do 


728 


g -> c 


do 


729 


c->t 


do 


840 


c -> t 


Eagl site is eliminated 


904 


a -» t 


creates Xba I site on pos. 904-909 


905 


g->c 


do 


906 


C-M 


do 


907 


c -> a 


do 


909 


c -> a 


do 


994 


a-*t 


creates Sac I site on pos. 990-995 


995 


g ~> c 


do 


1116 


c->t 


creates Spel site on pos. 1114-1119 


1119 


c-> t 


do 


1273 


a -> t 


creates Xhol site on pos. 1272-1277 


1274 


g -> c 


do 


1275 


c g 


do 


1293 


c->t 


Bgl II site is eliminated 


1443 


c->t 


BstXI site is eliminated 


1452 


g -> c 


do 


1467 


c->t 


Pstl site on pos. 1466-1471 


1470 


c — ► a 


do 


1590 


g c 


Pstl site on pos. 1588-1593 is eliminated 


1620 


g c 


Pstl site on pos. 1618-1623 is eliminated 


1638 


c->t 


creates Xbat site on pos. 1638-1643 


1641 


g -► a 


do 


1653 


g -> c 


Pstl site is eliminated 


1687 


a -> t 


Pstl site is eliminated 


1688 


g -> c 


Pstl site is eliminated 


1710 


c -> g 


creates Eagl site on pos. 1709-1714 


1758 


c->t 


Bgl II site is eliminated 


1875 


g -+ c 


Pstl site is eliminated 


1893 


c -> a 


Hind III on pos. 1893-1898 


1897 


c -> t 


do 


1944 


c -> t 


Bgl II site is eliminated 
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Position: 



substitution 



Remarks: 



2199 
2202 
2203 
2253 
2292 
2320 
2321 
2322 
2325 
2430 
2433 



c->t 

c-»t 

c->t 

c -> g 

g->c 

a->t 

a->t 



Cla I site on pos. 2198-2203 



do 
do 



Sacll site is eliminated 

Pstl site is eliminated 

Pstl site on pos. 2321-2326 is eliminated 



c -» a 
c -> a 
c->t 



do 
do 
do 



creates EcoRI site on pos. 2429-2434 
do 



Example 2: synthesis of oligos 

In order to clone the individual snuts, nucleotide strands were synthesised or purchased. In 
total 28 synthetic nucleotide strands were synthesised. Nucleotide strands were synthesised 
5 by standard 0.2 \imo\ p-cyanoethyl-phosphoramidite chemistry on an Applied Biosystems 
DNA synthesiser model 392, employing 2000 A CPG columns (Cruachem, Glasgow, 
Scotland), acetonitrile containing less than 0.001% water (Labscan, Dublin, Ireland) and 
standard DNA-synthesis chemicals from Cruachem, including phosphoramidites at 0.1 M 
and Tetrahydrofuran/N-methylimidazole as cap B solution. The nucleotide strands O-N-C 

10 and 1 19MS-RC (for cloning of snut O-N-Lang), 650-E-BG and 720-XBAC-31 (for cloning of 
snut 650-720-EcoRI), 2425esup and 2425ESdo (for cloning of snut 2425-E-S) were 
synthesised with 5' end "trityl on" and purified on "Oligonucleotide Purification Cartridges" 
(Perkin Elmer, CA, USA) as described by the manufacturer. Other nucleotide strands (235- 
EC05, 375-pst1.seq, 495-Clalseq,900-Xbal, 990-sac1, 1110-SPE, 1630-Xba.seq, 1700- 

15 Eag.seq, 17-Eag.seq, 1890-Hind.MPD, 2060-sac, 2190-cla, 2330-pst) were synthesised with 
5' end "trityl off and purified by standard ethanol precipitation. Oligoes 1265-1 UP, 1265- 
1DO, 1265-2UP, 1265-2DO, 1265-3UP, 1265-3DO, 1465-1UP, 1465-1DO, 1465-2UP, 1465- 
2DO were purchased from Pharmacia. 

Example 3: Cloning of snuts 

20 The nucleotide sequence construct was designed in 17 DNA small pieces called snuts 

(Table 3) encompassing important structures like variable and constant regions each flanked 
with restriction enzyme (RE) sites to facilitate cassette exchange within each third of the 
gene: Nhe\-Xho\, Xhol-Sacll, Sacll-BamHI. 

25 Each snut was cloned individually in a commercial vector (pBluescriptKS or pMOSblue) and 
kept as individual DNA plasmids, named after the snut which gives the nucleotide position of 
the RE in the BX08. 
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Table 3 list the snuts by their name and cloning vector. 



Name Cloning vector: 



So-N-Lang 


pMOSblue 


S235E00RV 


pMOSblue 


S375PSH 


pBluescriptSK 


S495Clal 


pMOSblue 


Sa5o.720EcoRI 


pMOSblue 


SgoOXbal 


pMOSblue 


Ss&oSacI 


pMOSblue 


SmoSpel 


pMOSblue 


Si265Xhol 


pBluescriptSK 


Si465Pstl 


pBluescriptSK 


Si630Xbal 


pBluescriptSK 


Si700Eagt 


pBluescriptSK 


Si890Hindlll 


pBluescriptSK 


S2060Sacll 


pMOSblue 


S21 90Clal 


pMOSblue 


S2330Pstl 


pMOSblue 


S2425ES 


pBluescriptSK 



Three principally different methods were used to obtain the dsDNA corresponding to each of 
5 the 17 snuts needed to build the synthetic BX08 genes. 



1) "Overlapping" PCR: is based on the use of two ssDNA template nucleotide strands 
(forward and reverse) that complement each other in their 3'-end (figure 8). During the first 
PCR cycle, both templates annealed to each other at the 3-ends allowing the full-length 
10 polymerisation of each complementary strand during the elongation step. The newly 

polymerized dsDNA strand are then amplified during the following cycles using an adequate 
forward and reverse primers set (figure 8). 



Snut O-N-LANG: (So-N-iang) two ng of the forward template nucleotide strands O-N-C and 2 
15 ng of the reward template-nucleotide strand 1 1 9MS-RC were mixed together with SOpmoles 
of the forward primer O-N-LANG-5 (S'-CTAGCTA-GCGCGGCCGACCGCCT -3') and 
SOpmoles of the reverse primer O-N-LANG-3 (5-CTCGATATCCTCGTGCATCTGCTC -3') in 
a 100^1 PCR reaction volume containing 0.2mM dNTP's, 1x ExpandHF buffer with MgCI 2 
(1.5mM) and 2.6 units of enzyme mix (Expand™ High fidelity PCR system from Boehringer 
20 Mannheim). The PCR was performed with the PE Amp 9600 thermocycler (Perkin Elmer) 
using the following cycle conditions: initial denaturation at 94°C for 30 sec, followed by 30 
cycles of 94°C for 15 sec, 65°C for 30 sec, 72°C for 45 sec, with a final elongation at 72°C 
for 5 min., and cooling to 4°C. 
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Snut 650-720-EcoRI: (S 6 5o-72oecori) PCR amplification was performed as described for snut 
O-N-LANG. One \ig of the forward ssDNA template-oligonucleotide 650-E-BG and 1 jig of 
the reverse ssDNA template-oligonucleotide 720-XBAC were mixed with 40pmoles of the 
forward primer 650-E-5 (5'-CCGGAATT-CGCCCCGTGGTGAGCA-3') and 40 pmoles of the 
5 reverse primer 720-X-3 (5'-CTGCTCTAGAGATGTTGCAGTGGGCCT-3*). 

2) "Normal" PCR amplification: Eleven nucleotide strands: 235-EC05, 375-pst1, 900-xba1 t 
990-sad, 1110-SPE, 1630-XBA, 1700-EAG, 1890-HIN, 2060-sac, 2190-cla, and 2330-pst, 
were designed with common 5* and 3' flanking sequences which allowed PCR amplification 

10 with the same primer set (Forward primer : BX08-5 (5-AGCGGATAACAATTTCACACAGGA- 
3') and revers primer : BX08-3 (5'-CGCCAGGGTTTTCCCAGTCACGAC-3') (Figure 9). The 
495-Cla1 oligonucleotide was designed without a common flanking sequence and was 
therefore amplified with a specific set of primers 495-5N/495-3N ( 5-GAATCGAT- 
CATCACCCAG-3' and 5'-GACGAATTCCGTGGGTGCACT-3'). Each oligonucleotide was 

15 resuspended in 1ml of water and kept as a stock solution (approximatively 0.2 mM). PCR 
amplification was performed with the Expand™ High Fidelity PCR System from Boehringer 
Mannheim (Cat. No. 1759078). Four concentrations of template nucleotide strand were 
systematically used: undiluted stock solution, stock solution 10*\ stock solution 10' 2 , stock 
solution 10* 3 . One to 5 ^il of synthetic ssDNA template was amplified using the following 

20 conditions: BX08-5 (0.5*iM), BX08-3 (0.S\M), 4 dNTP's (0.2mM) t 1x ExpandHF buffer with 
MgCI 2 (1.5mM) and 2.6 units of enzyme mix. The PCR was performed using the PE Amp 
9600 thermocycler (Perkin Elmer) using the following cycle conditions: initial denaturation at 
94°C for 15 sec, followed by 30 cycles of 94°C for 15 sec, 65°C for 30 sec. and 72°C for 45 
sec, with a final elongation at 72°C for 7 min., and cooling to 4°C. 

25 

3) Minigene approach : This method was used to synthesise S 12 65Xhoi, Suesxbai and S 242 5es. 
Snut 2425-E-S (S 2425 es): 100 picomoles of each oligonucleotide 2425ES-up (35-mer ; 5'- 
AATTCGCCAGGGCTTCGAGCGCGCCCTGCTGTAAG-3') and 2425ES-do (35-mer ; 
GATCCTTACAGCAGGGCGCGCTCGAAGC-CCTGGCG-3') were mixed together in a 100 nl 

30 final volume of annealing buffer containing NaCI 25mM , Tris 10mM and 1mM EDTA. After 
denaturation at 94°C for 15 min., the mixed oligonucleotides were allowed to anneal at 65°C 
during 15min.. The annealing temparature was allowed to slowly decrease from the 65°C to 
room temperature (22°C) during overnight incubation. The resulting double-strand dsDNA 
fragments harbored EcoRI- and BamHI-restriction sites overhangs that allowed direct cloning 

35 in pBluescript KS(+) vector using standard cloning techniques (Maniatis 1996). 
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Snut 1265-XhoI (S 126 5Xhoi): This snut was built according to the strategy depicted in figure 10. 
Three minigenes were constructed following the same method described for snut 2425-E-S. 
These minigenes are named 1265-1, 1265-2 and 1265-3. The minigene 1265-1 results from 
the annealing of the oligonucleotides 1265-1 up (68-mer ; 5*-tcg agc agc GGC aag gag att 

5 TTC CGC CCC GGC GGC GGC GAC ATGC GCG ACA ACT GGC GCA GCG AGC T-3') and 1 265-1 do (68- 
mer ; 5-GTA CAG CTC GCT GCG CCA GTT GTC GCG CAT GTC GCC GCC GCC GGG GCG G AAA ATC 

TCC TTG CCG ctg C-3'). 1265-2 results from the annealing of 1265-2up (61-mer ; 5'-gta caa 

GTA CAA GGT GGT GAA GAT CGA GCC CCT GGG CAT CGC CCC CAC CAA GGC CAA GCG C-3') and 
1265-2d0 (63-mer ; 5'-CAC GCG GCG CTT GGC CTT GGT GGG GGC GAT GCC CAG GGG CTC GAT 

10 ctt cac CAC CTT gta CTT-3'). Finally, 1265-3 results from the annealing of 1265-3up (69-mer 

; 5-CGC GTG GTG CAG CGC GAG AAG CGC GCC GTG GGC ATC GGC GCT ATG TTC CTC GGC TTC CTG 

ggc gct gca-3 1 ) and 1265-3do (59-mer ; 5'-gcg ccc agg aag ccg agg aac ata gcg ccg 
atg ccc acg gcg cgc ttc tcg cgc tgc AC-3'). Each minigene were designed in order to 
present single strand overhangs at their 5' and 3 - ends that allow easy ligation and Xhol-Pstl 
15 direct cloning into pBlueScript KS+ vector. 

Snut 1465-Pstl (Si 465 psti): Two minigenes were constructed following the same methode 
described for snut 2425-E-S. These minigenes are named 1465-1 and 1465-2. The minigene 
1465-1 was obtained after annealing of 1465-1up (90-mer : 5 -ggc agc acc atg ggc gcc 

20 GCC AGC CTG ACC CTG ACC GTG CAG GCC CGC CAG CTG CTG AGC GGC ATC GTG CAG CAG CAG 
AAC AAC CTG CTG-3') and 1465-1 dO (98-mer : 5-CGC GCA GCA GGT TGTTCT GCT GCT GCA CGA 
TGC CGC TCA GCA GCT GGC GGG CCT GCA CGG TCA GGG TCA GGC TGG CGG CGC CCA TGG TGC 

tgc ctg CA-3'), whereas minigene 1465-2 results from the annealing of 1465-2up (78-mer ; 

5-CGC GCC ATC GAG GCC CAG CAG CAC CTG CTC CAG CTGA CCG TGT GGG GCA TCA AGC AGC TCC 
25 AGG CCC GCG TGC TGG CT-3') and 1465-2do (78-mer ; 5'-CTA GAG CCA GCA CGC GGG CCT GGA 
GCT GCT TGA TGC CCC ACA CGG TCA GCT GGA GCA GGT GCT GCT GGG CCT CGA TGG-3'. Each 

minigene were designed in order to present single strand overhangs at their 5' and 3 - ends 
that allow easy ligation and Pstl-Xbal direct cloning into pBlueScript KS+ vector using 
standard cloning techniques (Maniatis) (see figure 11). 

30 Example 4: assembly of snuts to pieces. 

The snut genes were then assembled into pieces (Table 4) so that unique restriction enzyme 
sites or mutagenesis can be used within each of these. This strategy will require fewer 
assemblings for optimal use of the cassette system. The following piece clones were made 
and kept individually for construction of the synBX08 gp160 gene (Figure 6): 
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Table 4 lists pieces by their name and their snut composition. 



Piece snut composition vector 

name 

Pi S 0 . N . LANG -S235EcoRvS 3 75Psti pBluescriptSK 

P2 SgooxbarSwosacrSmospei pMOSblue 

p 3 p i-S495ciarS 6 5o-720EcoRrP2 pBluescriptSK 

P4gpieo S ie 3oxbarSi7ooEa9rSi890Hinin pBluescriptSK 

Ps S2i9ociarS 2 330PstrS 2 425Es pBluescriptKS 

P7 Pegpi6o-S 2 oeosacirP5 pBluescriptKS 

Peapieo S^sxhorS^espstrP^piso pBluescriptKS 



Piece 1: The building strategy is shown in figure 12. 

Preparation of the insert DNA: Five to 15^g of each plasmid 0-N-LANG-CI7 and 235-EcoRV- 
5 cISN, respectively, were double-digested by Xbal/EcoRV, and Pstl/EcoRV, according to 
classical RE digestion procedure (Maniatis). The RE digestion products, were agarose gel 
purified according classical method (Maniatis). All RE digests were loaded on a 3% Nusieve 
3:1 (FMC), TBE 0.5X agarose gel and submitted to electrophoresis (7 Volts/mm during 2- 
3hours) until optimal fragment separation. The agarose-band containing the DNA fragments 
10 that correspond to the snut's sequence sizes (243-bp for O-N-LANG and 143-bp for 235- 
EcoRV) were excised from the gel. The DNA was extracted from agarose by centrifugation 
20min at 5000g using a spin-X column (Costar cat#8160). Preparation of the vector: The 
snut 375-Pst1 klonl was used as plasmid vector. Five ng were digested with Xbal and Pst1. 
Removal of the polylinker Xbal/Pstl fragment was performed by classical agarose gel 
15 purification, using a 0.9% Seakem-GTG agarose, TBE 0.5X gel. The linearised plasmid DNA 
was extracted from the agarose by filtration through spin-X column. All purified DNA 
fragments were quantified by spectrophotometry. Ligation: All three DNA fragments O-N- 
LANG (Xbal/EcoRV), 235-EcoRV (Pstl/EcoRV) and 375-Pstl(Xbal/Pstl), were ligated 
together by classical ligation procedure, using an equimolar (vector:insert1:insert2) ratio of 
20 1:1:1. Thus for, 200 ng (0.1 pmole) of Xbal/Pstl-linearised 375-Pstl-ch were mixed with 16 
ng of O-N-LANG (Xbal/EcoRV) and 10 ng of 235-EcoRV (Pstl/EcoRV) in a final reaction 
volume of 20^1 of 1X ligation buffer containing 10U of T4 DNA ligase (Biolabs, cat#202S). 
The ligation was allowed overnight at 16°C. Transformation: Competent XL1-Blue bacteria 
(Stratagene cat#200130, transformation efficiency > 5»10 5 col/^g) were transformed by 
25 classical heat-chock procedure : 1/1 0th of the pre-chilled ligation reaction was mixed with 
50^1 of competent bacteria. The mixture was allowed to stand in ice during 30 min. Bacteria 
were heat-shocked at 42°C during 45 sec. and then left 2 min. on ice before being 
resuspended in 450 jil of SOC medium. Transformed bacteria were incubated 1 hour at 37°C 
under shaking (250rpm) and plated on LB-ampicilin agar plates. The recombinant clones 
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were allowed to grow 16 hours at 37°C. Colons/ screening: 1 0 to 50 recombinant colonies 
were screened by direct PCR screening according to the protocole described into the 
pMOSBlue blunt-ended cloning kit booklet (RPN 5110, Amersham). Each colony was picked 
and resuspended in 50^1 of water. DNA was freed by a boilling procedure (100°C, 5 
5 min).Ten |il of bacterial tysate were mixed to 1 nl of a 10mM solution of premixed 4 dNTP's , 
1 til of M13reverse primer (Spmoles/^il, 5-CAGGAAACAGCTATGAC-3'), 1 ^l of T7 primer 
(5pmoles/^l, 5'-TAATACGACTCACTATAGGG-3'), 5jil of 10x Expand HF buffer 2 
(Boehringer Mannheim, cat#1 759078), 0.5\x\ of Enzyme mix (Boehringer Mannheim, 5U/nl) 
in a final volume of 50^il. DNA amplification was performed with a thermo-cycler PE9600 

10 (Perkin-Elmer) using the following cycling parameters: 94°C, 2min, 35 cycles(94°C, 30sec; 
50°C, 15sec; 72°C,30sec); 72°C, 5min; 4°C hold. Five nl of the PCR products were analysed 
after electrophoresis on a 0.9% SeakemGTG , 0.5xTBE agarose gel. Nucleotide sequence 
confirmation: ds-DNA was purified from minicultures of the selected clones with the JETstar 
mini plasmid purification system ( Genomed Inc.). Sequencing was performed using 

15 M13reverse and 17 primers and with the Big DyeTM Terminator Cycle Sequencing Ready 
reaction kit (Perkin-Elmer, Norwalk, Connecticut, P/N4303152) and the ABI-377 automated 
DNA sequenator (Applied Biosystems, Perkin-Elmer,Norwalk, Connecticut). Data were 
processed with the Sequence Navigator and Autoassembler softwares (Applied Biosystems, 
Perkin-Elmer,Norwalk, Connecticut). 

20 

Piece 2: The strategy for building that piece is depicted in figure 13. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The linearised plasmid 1 1 10-Spel-cl24M1 was used as vector after being digested by 
25 Hindlll and Spet, and agarose gel purified. 

-A 166-bp Hindlll/Sacl, obtained from snut 900-Xbal-cl15, as well as a 130-bp Sacl/Spel 
fragment, obtained from snut 990-Sacl-cl14, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
in an one step ligation. 

30 - 100jil of competent SCS1 10 bacteria (Stratagene cat# 200247) were transformed with 
1/10th of the ligation products according to the manufactor instruction. 

- Direct colony PCR screening was performed using T7 primer and pMOS-R (5- 
GTTGTAAAACGACGGCCAG-S'). 
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Piece 3: The strategy for building that piece is depicted in figure 14. RE digestion, DNA 
fragments purification, ligation as well as direct PGR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The plasmid piece1-c!33 was linearised by Clal and Xhol, in order to be used as vector, 
5 and agarose gel purified. 

- A 161-bp Clal/EcoRI fragment, obtained from 495-Cla!-cl135M1 as well as a 254-bp 
EcoRI/Xbal fragment, obtained from 650-720-EcoRI-cI39, and a 374-bp Xbal/Xhol fragment, 
obtained from piece2-cI4, were agarose gel purified. 

- Equimolar amount (0.1 pmole) of each of these 4 DNA fragments were mixed and ligated 
10 together. 

- 50^i of competent XLIBlue bacteria were transformed with 1/10th of the ligation products 
according to the protocole described for piece! 

- Direct colony PGR screening was performed using M13Reverse and T7 primers. 

15 Piece 4 gp160: The strategy for building that piece is depicted in figure 1 5. RE digestion, 
DNA fragments purification, ligation as well as direct PCR screening of recombinant colonies 
were performed according the same procedures described above for piecel, except the 
following: 

- The plasmid 1630-Xbal-cl2 was linearised by Sacll/ Eagl digestion and agarose gel 
20 purified, in order to be used as vector. 

- A 190-bp Eagl/Hindlll fragment, obtained from snut 1700-Eagl-cl4, as well as a 177-bp 
Sacll/Hindlll fragment, obtained from snut 1890-Hindlll-cl8, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
in an one step ligation. 

25 - 50^ of competent XL1 Blue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel 

- Direct colony PCR screening was performed using M13Reverse and T7 primers. 

Piece 4-gp150: PCR-based site-directed mutagenesis was performed on double-stranded 
30 plasmid-DNA from piece4-c!4 according an adaptation of the ExSite™ PCR-Based Site- 
Directed Mutagenesis Kit procedure (Stratagene cat#200502)(Weiner, M.P., Costa, G.L, 
Schoettlin, W., Cline, J., Marthur, E., and Bauer, J.C. (1994) Gene 151:119-123). The 
mutations introduced are shown in bold letters in the primer sequences below. PCR 
amplification was performed with the Expand™ High Fidelity PCR System (Boehringer 
35 Mannheim, cat#1759078). Briefly, 1.5 ng, 0.5 ^g or 0.1 fig of circular dsDNA was mixed with 
1.5 pmoles of P4M2S (5'-TCTGGAAGCTCAGGGGGCTGCATCCCTGGC-3') and 1.5 
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pmoles of P4M2AS (5-CCCGCCTGCCCGTGTGACGGATCCAGCTCC-3') in a final volume 
of 50^1 containing 4 dNTPs (250nM each), 1x Expand HF buffer 2 (Boehringer Mannheim, 
cat#1 759078), 0J5nl of Enzyme mix (Boehringer Mannheim, 5U/|il). The PGR was 
performed with a PE9600 thermo-cycler (Perkin-Elmer Corporation) under the following 
5 cycling parameters : 94°C, 2min ; 15 cycles (94 °C, 45 sec ; 68 °C, 4 min); 72°C, 7min and 4 
°C, hold. PGR products were phenolxhloroform extracted and precipitated (Maniatis). 
Plasmid template was removed from PCR products by Dpnl treatment (Bio!abs)(Nelson, M., 
and McClelland, M., 1992) followed by ethanol-precipitation. Amplicons were resuspended in 
50 pJ stent water, and phosphorylated according the following procedure: 7.5 y\ of amplicons 

10 were mixed with 0,5jal of 100mM DTT, 1 \x\ of 10x pk buffer and 1 |il of pk mix enzyme 

(pMOSBlue blunt-ended cloning kit, Amersham , cat#RPN 5110). DNA kinasing was allowed 
5min at 22°C. After heat-inactivation (10min, 75°C) of the pk enzyme, 1^1 of ligase (4units, 
Amersham , cat#RPN 5110) was directly added to the pk reaction. The ligation was allowed 
overnight at 22°C. 50^1 of competent XLIBlue bacteria were transformed with 1/10th of the 

15 ligation reaction according to the classical protocol (Maniatis). Insertion of mutations was 
checked by sequencing. 

Piece 4-gp140: PCR-based site-directed mutagenesis was performed on piece 4-cl4, 
according to the procedure described for piece4-gp150 except that the primers P4M1AS (5- 
20 TGTGTGACTGATTGAGGATCCCCAACTGGC-3') and P4S (5'- 
AGCTTGCCCACTTGTCCAGCTGGAGCAGGT-3') were used. 

Snut 1265-Xhol-gp120 : PCR-based site-directed mutagenesis was performed on plasmid 
1265-Xhol-cl2M1 according to the procedure described for piece4-gp150 except that the 
25 primers 1265MAS (5 , -CTTCTCGCGCTGCACCACGCGGCGCTTGGC-3 , ) and 1265M2S (5- 
CGCGCCTAGGGCATCGGCGCTATGTTCCTC-3') were used. 

Snut 1265-Xhol-gp160/uncleaved : PCR-based site-directed mutagenesis was performed 
on plasmid 1265-Xhol-cl2M1 according to the procedure described for piece4-gp150 except 
30 that the primers 1265MAS (5'-CTTCTCGCGCTGCACCACGCGGCGCTTGGC-3') and 
1265M2S (S'-AGCGCCGTGGGCATCGGCGCTATGTTCCTC-S') were used. 

Snut 1465-Pstl-CCG : PCR-based site-directed mutagenesis was performed on plasmid 
1465-Pstl-cl25 according to the procedure described for piece4-gp150 except that the 
35 primers 1465MAS (5'-CTGCTTGATGCCCCACACGGTCAGCTG-3') and 1465MS (5*- 
TGCTGCGGCCGCGTGCTGGCTCTAGA-3') were used. 
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Piece 5 : The strategy for building that piece is depicted in figure 16. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 
5 - The plasmid 2425-ES-cl2 was linearised by Clal/EcoRI digestion and agarose gel purified, 
in order to be used as vector. 

- A 129-bp Pstl/Clal fragment, obtained 2190-Clal-cl6M15, as well as a 114-bp Pstl/Ecorl 
fragment, obtained from 2330-Pstl-cl8, were agarose gel purified. 

- Equimolar amount (0.1 pmoles) of the three DNA fragments described above were ligated 
10 in an one step ligation. 

- 50nl of competent XL1 Blue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel. 

- Direct colony PCR screening was performed using T3 (5'-ATTAACCCTCACTAAAG-3') and 
T7 primers. 

15 

piece 8 : The strategy for building that piece is depicted in figure 17. RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following; 

- The plasmid piece4-cl4 was linearised by Xbal/Xhol and agarose gel purified, in order to be 
20 used as vector. 

- A 200-bp Xhol/Pstl fragment, obtained from 1265-Xhol-cl2M1 as well as a 178-bp Pstl/Xbal 
fragment, obtained from 1465-Pstl-cl25 were agarose gel purified. 

- Equimolar amount (0.1 pmole) of these 3 DNA fragments were mixed and ligated together. 

- 50pJ of competent XL1 Blue bacteria were transformed with 1/1 0th of the ligation products 
25 according to the protocole described for piecel. 

- Direct colony PCR screening was performed using T3 and T7 primers. 

piece 8-gp150 : The strategy for building that piece was identical to that of piece 8, except 
that piece4-cl4M3 was used as vector (figure 18). 

30 

piece8-gp150/ uncleaved : The strategy for building that piece is identical to that of piece 8, 
except that piece4 gp160-cl4M3 is used as vector and a 200-bp Xhol/Pstl fragment, obtained 
from snut 1265-Xhol-gp160/uncleaved as well as a 178-bp Pstl/Xbal fragment, obtained from 
snut 1465-Pstl-CCG are used like inserts. 

35 
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piece 8-gp140 : The strategy for building that piece was identical to that of piece 8, except 
that piece4-cl4M5 was used as vector (figure19). 

piece8-gp140/ uncleaved : The strategy for building that piece is identical to that of piece 8, 
5 except that piece4-cl4M5 is used as vector and a 200-bp Xhol/Pstl fragment, obtained from 
snut 1265-Xhol-gp160/uncleaved as well as a 178-bp Pstl/Xbal fragment, obtained from snut 
1465-Pstl-CCG are used like inserts. 

Piece8-gp41 : The strategy for building that piece is depicted in figure 20. RE digestion, 
10 DNA fragments purification, ligation as well as direct PCR screening of recombinant colonies 
were performed according the same procedures described for synBX08 gp160 gene. A 63 
bp linker is to be made according to the method described for snut 2425-ES, the minigene 
appraoch. Thus for 2 complementary oligonucleotides: 1265-gp41S(5- 
TCGAGgctagcGCCGTGGGCATCGGCGCTATGTTCCTCGGCTTCCTGGGCGctgca-3') and 
15 1265-gp41AS (5'-gCGCCCAGGAAGCCGAGGAAC- 

ATAGCGCCGATGCCCACGGCgctagcC-3' should be annealed together. This synthetic 
linker will be directly ligated into the Xho\ /Pst\ sites of piece8-klon13 from which the snut 
1265-Xhol-cl 2M1 would have been removed. 

20 piece7 : The strategy for building that piece is depicted in figure 21 . RE digestion, DNA 
fragments purification, ligation as well as direct PCR screening of recombinant colonies were 
performed according the same procedures described above for piecel, except the following: 

- The plasmid piece5-cl1 was linearised by Clal/Xhol and agarose gel purified, in order to be 
used as vector,. 

25 - A 798-bp Xhol/Sacll fragment, obtained from piece8-cl13 as well as a 140-bp Sacll/Clal 
fragment, obtained from 2060-Sacll-c!21 were agarose gel purified. 
-The ligation of the 3 fragments was performed using a vectoninsert ratio of 1:1, 1:2 or 1:5. 

- 50|il of competent XLIBlue bacteria were transformed with 1/1 0th of the ligation products 
according to the protocole described for piecel. 

30 - Direct colony PCR screening was performed using M13Reverse and T7 primers. 

Example 5: assembly of genes 

synBX08 gp160 gene : The strategy for building that gene is depicted in figure 6. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for piecel. 
35 20 ng of the expression plasmid WRG7079 were digested by Nhel/BamHI. Plasmid DNA- 
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ends were dephosphorylated by Calf Intestin Phosphatase treatment (CIP, Biolabs) 
(Maniatis) to avoid autoligation of any partially digested vector. CIP enzyme was heat- 
inactivated and removed by classical phenol-chloroforme extraction. A 1277-bp Nhel/Xhol 
fragment, obtained from piece3-c!27, as well as a 1194-bp Xhol/BamHI fragment, obtained 
5 from piece7-cl1, were agarose gel purified. The ligation was performed using a vectoninsert 
ratio of 1:1 or 1:2. Fifty \x\ of competent XL1 Blue bacteria were transformed with 1/1 0th of the 
ligation product according to the protocole described for piece 1. After transformation bacteria 
were plated on LB-kanamycin agar plates. Direct PCR colony screening was performed 
using the primer set WRG-F (5-AGACATAATAGCTGACAGAC-3') and WRG-R (5'- 
10 GATTGTATTTCTGTCCCTCAC-3'). The nucleotide sequence was determined according the 
methods described above for piecel. 

synBX08 gp150 gene : The strategy for building that gene is depicted in figure 5. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 

15 recombinant colonies were performed according the same procedures described for 

synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-c!27, as well as 
a 800-bp Xhoi/BamHI fragment, obtained from piece 8-gp150-cl26, were agarose gel purified 
and then ligated into the Nhel/BamHI WRG7079 sites. The ligation was performed using a 
vector:insert ratio of 1:1 or 1:2. 

20 For construction of the synthetic BX08 gp1 50, piece4 was mutated to Piece4gp1 50 whereby 
a tyrosine -> cysteine was changed and a stop codon was introduced after the 
transmembrane spanning domaine ( TMD), followed by a BamHI cloning site. A new 
piece8gp150 was constructed composed of snut1265/snut1465/piece4gp150. 

25 synBX08 gp1S0/uncleaved gene : RE digestion, DNA fragments purification, ligation as 
well as direct PCR screening of recombinant colonies are performed according the same 
procedures described for synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained 
from piece3-c!27, as well as a 800-bp Xhol/BamHI fragment, obtained from piece 8- 
gp150/uncieaved, are agarose gel purified and then are ligated into the Nhel/BamHI 

30 WRG7079 sites. The ligation was performed using a vectoninsert ratio of 1:1 or 1:2. 

synBX08 gp 140 gene : The strategy for building that gene is depicted in figure 4. RE 
digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for 
35 synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cl27, as well as 
a 647-bp Xhol/BamHI fragment, obtained from piece 8-gp140-cl2, were agarose gel purified 
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and then ligated into the Nhel/BamHI sites of WRG7079. The ligation was performed using a 
vectorinsert ratio of 1 :1 or 1 :2. For construction of the synthetic BX08 gp140, piece4 was 
mutated to Piece4gp140 whereby a stop codon was introduced just before the TMD followed 
by a BamHI cloning site. A new piece8gp140 was constructed composed of 
5 snut1265/1465/piece4gp140. 

synBX08 gp140/uncleaved gene : RE digestion, DNA fragments purification, ligation as 
well as direct PCR screening of recombinant colonies are performed according the same 
procedures described for synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained 
10 from piece3-cl27, as well as a 800-bp Xhol/BamHI fragment, obtained from piece 8- 

gp140/uncleaved, are agarose gel purified and then ligated into the Nhel/BamHI WRG7079 
sites. The ligation was performed using a vectorinsert ratio of 1:1 or 1:2. 

synBX08 gp 120 gene : The strategy for building that piece is depicted in figure 3. RE 
15 digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for 
synBX08 gp160 gene. A 1277-bp Nhel/Xhol fragment, obtained from piece3-cl27, as well as 
a 206-bp Xhol/BamHI fragment, obtained from 1265-Xhol-gp120-clM5, were agarose gel 
purified and then ligated into the Nhel/BamHI sites of WRG7079. The ligation was performed 
20 using a vectorinsert ratio of 1:1 or 1:2. For construction of the synthetic BX08 gp120, snut 
' 1265 was mutated to Si 2 65g P i2o to introduce a stop codon at the gp120/gp41 cleavage site 
followed by a BamHI cloning site. 

The gp160, gp150, gp140, and gp120 genes are cloned (Nhel-BamHI) and maintained in an 
25 eucaryotic expression vectors containing a CMV promoter and a tPA leader, but other 

expression vectors may be chosen based on other criteria e.g. antibiotic resistance selection, 
other leader sequences like CD5 etc, presence or not of immune stimulatory sequences etc. 

SynBX08 gp41 gene : The strategy for building that gene is depicted in figure 20. RE 
30 digestion, DNA fragments purification, ligation as well as direct PCR screening of 
recombinant colonies were performed according the same procedures described for 
synBX08 gp160 gene. Piece 8-gp41 is ligated with snut 2060-Sacll-klon21 and piece 5 as 
already decribed for the construction of piece 7, creating piece 7-gp41 (P7 gP 4i). Subsequently 
the piece 7-gp41 containing the entire gp41 gene will be cloned in WRG7079 using the Nhel 
35 and BamHI sites. 
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Example 6a: High expression by codon optimization 

To analyse the expression of glycoproteins from the wild type and synthetic BX08 envelope 
genes RIPA was performed on transfected mammalian cell lines. Both cell membrane 
associated and secreted HIV-1 glycoproteins from the cell supernatants were assayed. The 
5 envelope plasmids were transfected into the human embryonic kidney cell line 293 (ATCC, 
Rockville, MD) or the mouse P815 (H-2D d ) cell line using calcium phosphate (CellPhect 
Transfection kit, Pharmacia). For radio immune precipitation assay (RIPA), transfected cells 
were incubated overnight, washed twice and incubated for 1 hour with DMEM lacking 
cysteine and methionine (Gibco). Then the medium was replaced with medium containing 50 

10 pCi per ml of [^S] cysteine and 50 pCi/ml of [^SJ methionine (Amersham Int., Amersham, 
UK) and incubation continued overnight. Cells were centrifuged, washed twice with HBSS 
and lysed in 1 ml ice-cold RIPA buffer (10 mM Tris, pH 7.4, 150 mM NaCI, 50 mM EDTA, 1% 
Nonidet P-40, 0.5% sodiumdeoxycholate) to detect membrane bound Env glycoproteins. The 
cell lysates were centrifuged for 15 min. at 100,000 x g to remove any undissolved particles 

15 and 100 |J immune precipitated with protein A-sepharose coupled human polyclonal IgG 
anti-HIV antibodies (Nielsen et al., 1987). For analysis of secreted Env glycoproteins 500 nl 
of the 5 ml supernatants from transfected cells were incubated with protein A-sepharose 
coupled human polyclonal IgG anti-HIV antibodies. After washing three times in cold RIPA 
buffer and once in PBS, the immuno precipitates were boiled for 4 min. in 0.05 M Tris-HCI, 

20 pH 6.8, 2% SDS, 10% 2-mercaptoethanol, 10% sucrose, 0.01% bromophenol blue and 
subjected to SDS PAGE (Vinner et al., 1999). Electrophoresis was carried out at 80 mV for 1 
hour in the stacking gel containing 10% acrylamide, and at 30 mV for 18 hours in the 
separating gradient gel containing 5-15% acrylamide. Gels were fixed in 30% ethanol-10% 
acetic acid for 1 hour, soaked for 30 min. in En3Hance (Dupont #NEF 981), washed 2*15 

25 min. in distilled water, dried and autoradiography performed on Kodak XAR-5 film. 
Transfection of human 293 cells with the syn.gp120 B xo8 and syn.gp140 B xoe genes, 
respectively, resulted in high amounts of only secreted HIV-1 glycoproteins (Fig. 22a, lane 9 
and 8). Thus, the synthetic gene in the absence of rev expresses the HIV-1 surface 
glycoprotein of the expected size which is recognised by human anti-HIV-1 antisera. The 

30 expression of BX08 gp120 was Rev independent and with roughly the same high amount of 
gp120from the syn.gp120MN gene (Fig. 22a, lane 2). Fig. 22a, lane 6 and lane 7 shows the 
expression of only membrane bound gp160 and gp150 from 293 cells transfected with 
syn.gp160Bxo8 and syn.gp150 B xoe plasmids, respectively. Also transfection with wt.gp160 B xos 
plasmid resulted in a significant expression of membrane bound gp160 despite the absence 

35 of Rev (Fig. 22a, lane 3). Co-transfection with equimolar amounts of Rev encoding plasmid 
seemed to increase this expression somewhat (Fig. 22a, lane 4). This is seen despite the 



WO 00/29561 



41 



PCT/DKOO/00144 



lower transfection effectivity using two plasmids and the use of only half the amount of 
wt.gp160 B xo8 DNA when combined with pRev. The amounts of secreted HIV-1 glycoproteins 
from gp120 and gp141 accumulating in the cell supernatants seemed higher than the 
amounts of cell associated glycoproteins at the time of harvesting of the cells. Interestingly, 
5 the amounts of gp160 produced from the "humanized" gene were about equal to the 
amounts produced by the wt.gp160 B xoe + pRev genes, respectively (Fig. 22a t lane 4 and 6). 
The processing of gp160, gp150 and gp140 into gp120 plus a gp41, or fractions of gp41, 
produced from wild type or synthetic genes in the 293 cell-line did not function well under 
these experimental conditions. Same phenomenon was seen in RIPA from 293-CD4 cells 
10 and HeLa-CD4 cells infected by HIV-1 MN (Vinner et al., 19999). Because of the absence of 
CCR5 these cell-lines could, however, not be infected by HIV-1 strain BX08. 

Example 6b: Radio immuno precipitation assay (RIPA) of synthetic BX08 transfected 
cells showing expression of glycoproteins from synthetic BX08 env plasmids 

15 The synthetic envelope plasmid DNA were transfected into the human embryonic kidney cell 
line 293 (ATCC, Rockville, MD) using calcium phosphate (CellPhect Transfection kit, 
Pharmacia). For immune precipitation analysis, transfected 293 cells were treated and 
analysed according to the method described in example 6a. 
To analyse expression from these genes, an SDS-PAGE of the 35 S-labelled HIV-1 

20 envelopes, immune precipitated from the transfected cells is shown in figure 22b. Both cell- 
membrane associated and secreted HIV-1 envelope glycoproteins in the cell supernatants 
were assayed. Transfection of 293 cells with the synthetic BX08 gene encoding gp120 
(syn.gp120BX08) in lane 4, and syn.gp140BX08 (lane 5) that did not contain rev encoding 
regions, resulted in abundant amounts of HIV-1 gp120. Thus, the expressions were Rev 

25 independent and expressed in roughly same high amounts as the syn.gp120MN and 

syn.gp160MN genes (lane 3 and 2, respectively) already showed by our group and others to 
be markedly increased in comparisson with HIV MN wild type genes including rev (Vinner et 
al 1999). 

Transfection with syn.gp150 plasmid (lanes 7 and 8) resulted in significant expression of 
30 membrane associated gp120 and low detactable amounts of truncated form of gp41 (cell 
pellet in lane 7) with no detectable HIV-1 glycoprotein in the cell supernatant lane 8. It is 
concluded that the synthetic BX08 genes expres the envelope glycoproteins of expected size 
which are recognised by human anti-HIV-1 antiserum. 
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Exampel 6C FACS 

To quantitate the surface expression of HIV glycoproteins from the wild type and synthetic 
BX08 envelope genes transfection experiments were done and cell surface expression 
5 examined by FACS (flow cytometer). 

10 ^g of the BX08 envelope plasmid (wild type BX08gp160 or synBX08gp160) plus 10 ng of 
an irrelevant carrier plasmid pBluescript were used to transfect a 80-90% confluent layer of 
293 cells in tissue culture wells (25 cm 2 ) using the CellPect kit (Pharmacia). After 48 hours 

10 cells were Versene treated, washed and incubated with a mouse monoclonal IgG antibodies 
to HIV gp120 (NEA-9301 , NEN™, Life-Science Products Inc., Boston) for time 30 min. on wet 
ice followed by washing in PBS, 3% FCS and incubation with Phyto-Erytrin (PE) labelled rat 
anti-mouse lgG1 (Cat #346270, Becton Dickinson) according to the manufacture. After 
washing the cells were fixed in PBS, 1% paraformaldehyd, 3% FCS, and analysed on a 

15 FACS (FACScan, Becton-Dicknsson). Table 5 show in duplicate expression of BX08 gp160 
from 1 1 % of the cells transfected with wild type BX08 (number 1 and 2) compared to the 48 
% of cells expression BX08 glycoprotein when transfected with the synthetic gene (number 3 
and 4). Thus, a several fold higher expression is obtained using the synthetic BX08 gene. 

20 Table 5 FACS analysis of 293 cells transfected with synBX08gp160 (No 1 and 2) and 
wtgp16+BX08 (No3 and 4) and stained with monoclonal antibodies to surface expressed 
HIV glycoproteins. A higher expression was obtained with the synthetic gene (mean 48%) as 
compared to the wild type gene (mean 1 1 %). 



50 ul 


45 ul 


A 


B 


C 


C-A 


C-B 


1 syn.gp160BX08 + 


pBluescript SK+ 


2,57 


2,85 


36,91 


34,34 


34,06 


2 syn.gp160BX08 + 


pBluescript SK+ 


2,83 


2,14 


58,42 


55,59 


56,28 


3 wt.gp160BX08 + 


pBluescript SK+ 


1,95 


1,52 


7,51 


5,56 


5,99 


4 wt.gp160BX08 + 


pBluescript SK+ 


2,97 


1.42 


14,41 


11,44 


12,99 



A: No primary antibody added (control for unspecific secondary Ab binding) 
25 B: Neither Primary Ab nor Secondary Ab added (autoflouroscense control) 
C: Primary Ab and secondary Ab added. 



Example 6D Analyses of the surface expression and biological functionality 

To analyse the surface expression and biological functionality from the wild type and 
30 synthetic BX08 envelope genes transfection experiments were done and cell fusion 
microscopically studied using HIV envelope receptor expressing cells. 
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10 ng of the BX08 envelope plasmid (wt.BX08gp160 or syn.BX08gp160 or empty WRG7079 
vector plasmid) plus 5 \ig of a plasmid (pEGFP, Clonetech) expressing green fluorescent 
protein (GFP) were transfected into 2 x 10 6 adherent U87.CD4.CCR5 cells (NIH AIDS Res. & 
Reference program, catalog #4035) stabely expressing CD4 and CCR5, using the CellPhect 
5 transfection kit (Pharmacia). After 48 hours the cells were examined by microscopy and 
photographed (Fig 22c). 

Fig 22c panel A show the negative control (empty WRG7079 plus pGFP) giving no syncytia. 
Panel B show cells transfected with the wild type BX08 gp160 plasmid where cell-to-cell 
fusion (syncytia) is seen. Panel C show cells transfected with the same amounts of 
10 synBX08gp160 plasmid and demonstrating a much higher degree of cell-cell fusion. In fact 
most or all of the cells in the culture plate were fused at this time. This experiment show 
surface expression of functional HIV gp160 with tropism to the CCR5 receptor, as well as a 
much higher expression and biological activity from the synthetic BX08 gene as compared to 
the wild type equivalent. 

15 

Example 7: Gene inoculation of mice for immunization 

6-7 weeks old female BALB/c mice were purchased from Bomholdtgaard, Denmak. 
Microbiological status was conventional and the mice were maintained in groups of 4/5 per 
cage with food and water ad libitum and artificially lighted 12 hours per day. Acclimatization 

20 period was 2 days. Mice were anaesthetized with 0.2 ml i.p. of rohypnol:stesolid (1:3, v/v) 
and DNA inoculated by either i.m. injection of 50 ^l 2 mg/ml of plasmid DNA in each tibia 
anterior muscle at week 0, 9, and 15 and terminated week 18; or gene gun inoculated on 
shaved abdominal skin using plasmid coated gold particles (0.95 \xm particles, 2 ^g DNA/mg 
gold, 0.5 mg gold/shot, 50-71% coating efficiency) with the hand held Helios® gene gun 

25 device (BioRad) employing compressed (400 psi) Helium as the particle motive force. Mice 
were gene gun vaccinated at week 0, 3, 6, 9, 15, and terminated week 18. 

Example 8: Serological assays 

Western blotting. The induction of a humoral response to gp120 and gp41 antigens by in 
30 vivo expression of the encoded glycoproteins from the synthetic BX08 genes was examined 
by western immuno blotting (Figure 27). Mouse antisera (1:40) were evaluated in western 
blotting using the commercial HIV BLOT 2.2 strips (Genelabs Diagnostic). The conjugate 
was a 1:200 dilution of the alkaline phosphatase-conjugated rabbit anti-mouse IgG 
(Dakopatts, Glostrup, Denmark). Buffers, incubation condition and colour development were 
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used according to the manufacturer. In these western blotting strips the gp160 band from 
HIV-1 MB contain of an oligomeric form of gp41 in a higher concentration than the 
monomericgp41 band on the strip (Genelabs Diagnostic). HIV-1 mB lysate is used in these 
commercial strips where the gp160 band is composed by addition of tetrameric gp41.AII 
5 preimmune sera tested negative in western blotting. Mice inoculated with syn.gp120 B xo8 
showed antibody response to the heterologous gp120 of HIV-1 IIIB. Inclusion of the 
extracellular part of gp41 in the gene syn.gp140 B xo8 induced antibody reaction to both gp120 
and gp41 in all mice. This confirms the in vivo expression of BX08 gp120 and the 
extracellular part of gp41. DNA vaccination with syn.gp160 B xos encoding the membrane 
10 bound glycoprotein induced antibodies to gp120 and gp41 in 50% and 64% of the mice, 
respectively. DNA vaccination with syn.gplSOaxoe induced detectable antibodies to gp120 
and gp41 in 41% and 53%, respectively. Induction of different levels of antibodies could 
explain the difference in numbers of positive reactive mice sera in this qualitative western 
blotting. 

15 ELISA. Mouse anti HIV-1 gp120 antibodies were measured by indirect ELISA. Briefly, wells 
of polystyrene plates Maxisorb (Nunc) were coated for 2 days at room temperature with HIV- 
1 IIIB recombinant gp120 (Intracel) at 0.2 ^ig/100 ^l of carbonate buffer, pH 9.6. Before use 
the plates were blocked 1 hour at room temperature with 150 ^l/well of washing buffer (PBS, 
0.5 M NaCI, 1% Triton-X-100) plus 2% BSA and 2% skimmilk powder. After 3 x 1 min. 

20 washings, mouse plasma was added at 100 ^l/well diluted in blocking buffer and ELISA 
plates incubated for 90 min. at room temperature using a microtiter plate shaker. As standard 
curve we used a mouse monoclonal antibody to a conserved part of gp120 between V5-C5 
(MRDNWRSELYKY) (#NEA-9301, NEN™ Life Science Products, Inc., Boston, MA). As 
calibration control included on each plate we used a plasma pool from 10 mice vaccinated 

25 with BX08 gp120. Plates were again washed 5x1 min. and incubated 1 hour at room 
temperature with 100 nl/well of HRP-conjugated rabbit anti-mouse IgG (#P260, Dakopatts, 
Giostrup, Denmark) diluted 1:1000 in blocking buffer. Colour was developed with 100 nl/well 
of peroxidase enzyme substrate consisting of 4 mg of o-phenylenediamine in 1 1 mi water 
plus 4 ^l hydrogen peroxide (30%, w/w). The enzyme reaction was terminated after 30 min. 

30 by 150 ^il/well of 1M H 2 S0 4 . The optical density (OD) of wells was measured at 492 nm using 
a microplate photometer (Molecular Devices, Biotech-Line, Denmark). Anti-HIV-gp120 IgG 
titers were expressed as the reciprocal plasma dilution resulting in an OD 49 2nm value of 0.500. 
Mouse anti-HIV-1 BX08 antibodies were also measured by indirect peptide ELISAs as 
described above using a BX08 V3 peptide (SIHIGPGRAFYTTGD) (Schafer, Copenhagen, 

35 Denmark). 
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The IgG antibody response to HIV-1m B rgp120 quantitated by ELISA is seen in Fig. 28 
and Fig. 29. No background activity was observed in preimmune sera or in sera from 4 mice 
immunized with empty WRG7079 vector in parallel with the BX08 genes. All mice inoculated 
with the synthetic BX08 genes either by gene gun or by i.m. injection responded and showed 
5 a persistent and high titered (about 100-10,000) IgG response to rgp120 as exemplified in 
Fig. 4. When comparing the median titers for groups of mice (Fig. 29) a moderate antibody 
response was observed with the wt.gp160 BX oe. Intramuscular and gene gun immunization 
with a mixture of wt.gp160 B xos plasmid plus Rev encoding plasmid did not increase this 
antibody response. This was found even when both plasmids were coated onto the same 

10 gold particles to ensure co-transfection of single target cells. However, to ensure inoculation 
of equal amounts of total DNA only half of the amount of wt.BX08 plasmid was used when 
mixing with pRev which may have contributed to the lower antibody response when pRev 
was included. A 5-foid improvement of the antibody response was obtained using the 
syn.gp160 B xo8 gene. This antibody response seemed further improved using the 

1 5 syn.gpl 50 B xoe gene where the cytoplasmic internalization signals were eliminated but only 
using gene gun inoculation. For both the gene gun inoculation of skin and i.m injection the 
highest antibody titers to rgp120 were induced by genes encoding secreted gp120/gp140 
glycoproteins versus membrane bound gp150/gp160 glycoproteins, respectively. In general, 
equal antibody and ELISA titers to rgp120 were obtained using gene gun and i.m. injection of 

20 the BX08 vaccine genes. 

Example 9: Neutralization assay 

Mouse plasma was diluted in culture medium (RPMI-1640 medium (Gibco) supplemented 
with antibiotics (Gibco), Nystatin (Gibco) and 10% FCS (Bodinco)) and heat inactivated at 
60°C for 30 min. Of the HIV-1 strain BX08 (50 TCID50 per ml propagated in PBMC) 250 pi 

25 was incubated for 1 hour at room temperature with 250 pi dilution of mouse serum (four five- 
fold dilutions of mouse serum, final dilutions 1:20 to 1:2500). After incubation 1 x 10 6 PBMC 
in 500 pi culture medium was added to the virus-serum mixture and incubated overnight at 
37°C in 5% CG 2 . Subsequently, eight replicates of 10 5 PBMC in 200 pi culture medium were 
cultured in 96-well culture plates (Nunc) at 37°C in 5% C0 2 . After seven days in culture the 

30 concentration of HIV antigen in the culture supernatant was quantitated using HIV antigen 
detection ELISA (Nielsen et al. ( 1987). 

This ELISA is performed using human IgG, purified from high titered patient sera, both as 
capture antibody and biotin-linked as detecting antibody. In brief, anti-HIV-capture IgG diluted 
1:4000 in PBS, 100 pl/well, are coated onto Immunoplates (Nunc) overnight at4 e C. After 
35 washing five times in washing buffer 1 00 ^ of supernatants are applied and incubated overnight 
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at 4°C. Plates are washed 5 times before incubation with 100 nl HIV-IgG conjugated with biotin 
diluted 1:1000 in dilution buffer, plus 10% HIV-1 seronegative human plasma for 3 hours at 
room temperature. Five times 1 min. washing in washing buffer are followed by 30 min. 
incubation with 100 nl of 1:1000 avidine-peroxidase (Dako P347 diluted in dilution buffer). Six 
5 times 1 min. washings, 5 in washing buffer and the last one are done in dH 2 0 before colour is 
developed with 100 ^l of peroxidase enzyme substrate consisting of 4 mg of OPD in 1 1 ml 
water plus 4 yi\ hydrogen peroxide (30 %, w/w). The enzyme reaction is terminated after 30 
minutes by additional 150 \x\ of 1M H 2 S0 4 . 

The HIV antigen concentration in cultures, preincubated with mouse serum, was expressed 
10 relatively to cultures without mouse serum (culture medium), and the percentage inhibitions 
of the different dilutions of mouse serum were calculated. The 50% inhibitory concentration 
(IC 50 ) for each mouse serum was determined by interpolation from the plots of percent 
inhibition versus the dilution of serum, and the neutralizing titer of the serum was expressed 
as the reciprocal value of the IC 50 . in each set-up a human serum pool known to neutralise 
15 other HIV-1 strains was included in the same dilutions as the mouse serum as a calibratin 
control. For assay of neutralization of the heterologous SHIV89.6P the MT-2-cell-killing 
format was used (Crawford et aL, 1999). The assay stock of SHIV89.6P was grown in human 
PBMC. 

The neutralizing IC 50 antibody titers of plasma pools from 10 mice from each group 
20 were measured at different time points (week 0, 9, and 18). A positive background in some 
preimmune sera and thus in all week 0 serum pools was noted even after dilution and heat 
inactivation that was found earlier to lower this background. In general the neutralizing titers 
to BX08 virus of such serum pools were transient and low ranging from 1:6-1:150 above 
background (data not shown). A possible cross-neutralization reaction to a heterologous, 
25 primary HIV-1 envelope was tested using the SHIV89.6P which is relevant in macaque 
models of AIDS and serum pools from mice DNA immunized i.m. with syn.gp140 B xos. 
Preimmune serum had a titer of 1:37, which is indicative of a slightly positive background, 
whereas the 18 week p.i. serum had a positive neutralizing titer of 1:254 above background. 

30 Example 10: CTL assay 

The cellular immune response in mice following gene gun or i.m. genetic immunization with 
the different vaccine plasmids were examined (Fig. 26). Spleen was removed aseptically and 
gently homogenised to single cell suspension, washed 3 times in RPMI-1640 supplemented 
with 10% FCS and resuspended to a final concentration of 5 x 10 7 cell/ml. The cells were 
35 then incubated 5 days with mitomycin-C treated (50 ng/ml for 1 hour) mouse P815 (H-2D d ) 
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stimulator cells at a ratio of 10:1 in medium supplemented with 5 x 10* 5 M p- 
mercaptoethanol. For assay of CTL response to HIV-1 BX08, P815 stimulator cells and 
target cells were pulsed with 20 ng/ml of the HIV-1 BX08 V3 peptide containing a conserved 
murine H-2D d restricted CTL epitope (IGPGRAFYTT) (Lapham et al M 1996). After 
5 stimulation, splenocytes were washed three times with RPMI-1640 supplemented with 10% 
FCS and resuspended to a final concentration of 5 x 10 6 cells/mi. 100 jxl of cell suspension 
was added in triplicate to U-bottom 96-well microtiter plates and a standard 4 hour 51 Cr- 
release assay performed (Marker et al., 1973). 

All synthetic BX08 plasmids induced a high specific CTL response thus confirming the in vivo 
10 expression and in vivo immunogenicity. The highest CTL response was obtained with 

syn.gp150 BX 08 followed by syn.gp12O 0X o8-syn.gp14OBxo8, and syn.gp160 BX 08, respectively. 

Thus, the CTL response induced did not correlate with the antigen being secreted or not. 

However, i.m. DNA immunization with syn.gplSOBxoa containing six putative CpG motifs 

induced a higher CTL response than gene gun immunization (Fig. 26). This difference could 
15 be explained by the high amount of DNA used in the i.m. injections. 

The T-lymphocyte cytokine profile of spleen cells after ConA stimulation as well as serum 

antibody lgG 2a /lgGi at week 18 were investigated. Neither the IFNy/IL-4 nor the IgG^IgG! 

ratios, which both reflects a Th1-type of immune response, were significantly higher for the 

i.m. immunized mice when compared with gene gun immunized mice (student t-test and 
20 Mann-Withney U-test). Thus, the CTL response did not correlate with a certain Th-type of 

response and the DNA immunization technique did not bias the immune response using 

synthetic BX08 genes. 

Example 11: Antibody responses to DNA vaccination with synBX08 env plasmid 
25 A relatively low and variable antibody response (1 of 10 mice) was obtained with gene gun 

inoculation of the syn.gp140BX08 plasmid vaccine starting at week 9, figure 23, right panel. 

A higher numbers of responders 3/10 with high lgG1 antibody responses at an earlier onset 

(week 3-9) was obtained with the syn.gp140BX08 plasmid using i.m. injection, left panel. 

Sera from later time points may show more responders and/or higher titers but are not 
30 assayed. However, these results show the induction of an antibody response to the BX08 V3 

peptide by DNA vaccination using one of the described synthetic BX08 constructs. 
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Claims 

1 . A method for producing a nucleotide sequence construct comprising the following steps: 

a) obtaining a first nucleotide sequence of an HIV gene from a patient within the first 12 
5 months of infection; 

b) designing a second nucleotide sequence utilising the most frequent codons from 
mammalian highly expressed proteins to encode the same amino acid sequence as the 
first nucleotide sequence of a) encodes 

c) redesigning the second nucleotide sequence of b) so that restriction enzyme sites 

10 surrounds the regions of the nucleotide sequence which encode functional regions of the 
amino acid sequence and so that selected restriction enzyme sites are removed thereby 
obtaining a third nucleotide sequence encoding the same amino acid sequence as the 
first and the second nucleotide sequence of a) and b) encode; 

d) redesigning the third nucleotide sequence of c) so that the terminal snuts contain 
15 convenient restriction enzyme sites for cloning into an expression vehicle; 

e) producing the snuts between restriction enzyme sites of c) and terminal snuts of d); 

f) assembling the snut of step e) to form a nucleotide sequence construct. 

2. A method according to claim 1 t wherein the HIV gene is the gene encoding the envelope. 

20 

3. A method according to claim 1 or 2, wherein the HIV gene encodes one or more Gag 
proteins. 

4. A method according to any of the preceding claims, wherein the HIV in step a) is in group 
25 M, O or N 

5. A method according to claim 4, wherein the HIV is a group M virus. 

6. A method according to any of the preceding claims, wherein the HIV is subtype A, B, C, D, 
30 E, F, G, H, I, or J. 

7. A method according to claim 6, wherein the HIV is subtype B. 

8. A method according to any of the preceding claims wherein the first nucleotide sequence 
35 is obtained by direct cloning. 
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9. A method according to any of the preceding claims, wherein the HIV in step a) is isolated 
with the first 11 months of infection, such a 10, 9, 8, 7, 6, 5, 4, 3, 2, 1. or 0.5 month after 
infection. 

5 

10. A method according to any of the preceding claims, wherein the redesigning in step c) is 
carried out after the second nucleotide sequence of step b) has been divided into pieces, 
so that each piece comprises only different restriction enzyme sites. 

10 11. A method according to claim 10, wherein the second nucleotide sequence of step b) is 
divided into 9 pieces, or 8, or 7, or 6, or 5, or 4, or 3, or 2 pieces. 

12. A method according to claim 1 1, wherein the second nucleotide sequence of step b) is 
divided into 3 pieces. 



15 



20 



13. A method according to any of the preceding claims, wherein the second nucleotide 
sequence of step b) is designed utilising the most frequent codons from human highly 
expressed proteins to encode the same amino acid sequence as the first nucleotide 
sequence of step a) encodes. 

14. A nucleotide sequence construct obtainable by the method of any of claims 1-13. 



15. A nucleotide sequence construct according to claim 14, wherein the nucleotide sequence 
encoding the amino acid sequence in the first variable region is surrounded by EcoRV 

25 and Pst\ restriction enzyme sites. 

16. A nucleotide sequence construct according to claims 14 or 15, wherein the nucleotide 
sequence encoding the amino acid sequence in the second variable region is surrounded 
by Pst\ and C/al restriction enzyme sites. 



30 



17. A nucleotide sequence construct according to any of claims 14-16, wherein the 
nucleotide sequence encoding the amino acid sequence in the third variable region is 
surrounded by C/al and EcoRI restriction enzyme sites. 
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18. A nucleotide sequence construct according to any of claims 14-17, wherein the 

nucleotide sequence encoding the amino acid sequence in the transmembrane spanning 
region is surrounded by HindlU and Sacll restriction enzyme sites. 

5 19. A nucleotide sequence construct according to any of claims 14-18, wherein the 

nucleotide sequence encoding the amino acid sequence on both sites of the cleavage site 
is surrounded by Pstt and Xba\ restriction enzyme sites, 

20. A nucleotide sequence construct in isolated form which has a nucleotide sequence with 
10 the general formula (I), (II), (III), or (IV) or subsequences thereof 

(I) PrS495ClarS650-720EcoRrP2-Si265gp120 

(II) PrS495ClarS650-72OEcoRrP2-S 12 65Xhor S^spstp P 4 gp 14 o 

(III) Pl-S495ClarS 6 60-720EcoRrP2-Sl285Xhor Si 4 65Pstr P4gp150 

(IV) Pi-S 4 95aarS65o-720EcoRrP2-S 12 65Xhor S^sspstr P4gpi6D* S206osacir Ps 

15 wherein designates the nucleotide sequence SEQ ID NO:41, a nucleotide sequence 

complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
thereto; 

wherein S49sciai designates the nucleotide sequence SEQ ID NO: 7, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 
20 95%thereto; 

wherein S 650 -72oecori designates the nucleotide sequence SEQ ID NO: 9, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 95% thereto; 

wherein P 2 designates the nucleotide sequence SEQ ID NO: 43, a nucleotide sequence 
25 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein S 126 5gpi2o designates the nucleotide sequence SEQ ID NO: 19, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 70% thereto; 

30 wherein S^sxhoi designates the nucleotide sequence SEQ ID NO: 17, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 80% 
thereto; 

wherein S 146 5Psti designates the nucleotide sequence SEQ ID NO: 23, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 90% 
35 thereto; 
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wherein P^puo designates the nucleotide sequence SEQ ID NO: 57 t a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4gp i5o designates the nucleotide sequence SEQ ID NO: 55, a nucleotide sequence 
5 complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

wherein P 4g p, 60 designates the nucleotide sequence SEQ ID NO: 53, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 
thereto; 

10 wherein S 2 <>6osacii designates the nucleotide sequence SEQ ID NO: 33, a nucleotide 

sequence complementary thereto, or a nucleotide sequence with a sequence identity of at 
least 98% thereto; and 
wherein P 5 designates the nucleotide sequence SEQ ID NO: 59, a nucleotide sequence 
complementary thereto, or a nucleotide sequence with a sequence identity of at least 85% 

15 thereto. 

21. A nucleotide sequence construct according to claim 20, with the formula (I) 

(I) Pl-S495darS650-720EcoRrP2-Si265gp120 

20 22. A nucleotide sequence construct according to claim 20, with the formula (II) 

(II) Pl-S495CtarS650-720EcoRrP2"Si265Xhor S-weSPstr P4gp140 

23. A nucleotide sequence construct according to claim 20, with the formula (III) 

(III) Pl-S 4 95ctarS650-720EcoRrP2"Si265Xhor Si465Pstr P4gp150 

25 

24. A nucleotide sequence construct according to claim 20, with the formula (IV) 

(IV) PrS495C1arS650-720EcoRrP2 - Si265Xhor Si465Psir P4gp160~ S 2 O60Sacir P5 

25. A nucleotide sequence construct according to claim 20 consisting essentially of the 
30 subsequence P^. 

26. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 4 g 5 ciai- 

35 27. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence Ssso^oecori- 
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28. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 2 . 

5 29. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 12 65g P i2o- 

30. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 12e5X hoi- 

10 

31. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 148 5Psti. 

32. A nucleotide sequence construct according to claim 20 consisting essentially of the 
15 subsequence P 4gp i 40 . 

33. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 4gp15 o. 

20 34. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 4gp i6o. 

35. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S 2a6 osacii 

25 

36. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence P 5 . 

37. A nucleotide sequence construct with a sequence identity of more than 85% to the 
30 nucleotide sequence construct in any of claims 20-35. 

38. A nucleotide sequence construct according to claim 37, wherein the sequence identity is 
more than 90% such as more than 95%, 98%, or 99%. 

35 39. A nucleotide sequence construct according to claim 37, wherein the sequence identity is 
100%. 
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40. A nucleotide sequence construct according to any of claims 14-39, coding for an HIV 
envelope or parts thereof with an improved immunogenicity obtained by mutating the 
nucleotide sequence construct of any of claims 14-39 such that one or more glycosylation 

5 sites in the amino acid sequence have been removed. 

41 . A nucleotide sequence construct according to claim 40 with a mutation at positions 
A307C + C309A and/or A325C + C327G and/or A340C + C342A and/or A385C + C387A 
and/or A469C + C471A or any combination of those. 

10 

42. A nucleotide sequence construct according to any of claims 14-41, coding for an HIV 
envelope or parts thereof with a binding site for the CXCR4 co-receptor in the third 
variable region. 

15 43. A nucleotide sequence construct according to claim 42 with a mutation at positions 
G865C + A866G. 

44. A nucleotide sequence construct according to any of claims 14-43, coding for an HiV 
envelope or parts thereof, wherein an immunodominant epitope has been modified. 

20 

45. A nucleotide sequence construct according to claim 44, wherein an immunodominant 
epitope in the third variable region has been modified. 

46. A nucleotide sequence construct according to claim 45 with a deletion of nucleotides 
25 793-897. 

47. A nucleotide sequence construct according to claim 44, wherein an immunodominant 
epitope has been removed from gp41. 

30 48. A nucleotide sequence construct according to any of claims 14-47, coding for an HIV 
envelope or parts thereof, wherein the cleavage site between gp41 and gp120 is 
removed. 

49. A nucleotide sequence construct according to claim 48 with a mutation at position 
35 C1423A. 
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50. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence Pi, S4 9 5ciai. S 6 so.72oecori. and P 2 . 

51 . A nucleotide sequence construct according to claim 20 consisting essentially of the 
5 subsequence Si^sxhoi, Suespstr, and P4g P i4o- 

52. A nucleotide sequence construct according to claim 20 consisting essentially of the 
subsequence S^esxnoi. S^espsti. P4gpieo. S^osaciw and P5. 

10 53. A nucleotide sequence construct according to any of claims 14-52, further comprising a 
nucleotide sequence repeat coding for a functional region of the amino acid sequence. 

54. A nucleotide sequence construct according to claim 53, wherein the nucleotide sequence 
repeat codes for amino acids in the third variable region. 

15 

55. A nucleotide sequence construct according to any of claim 14-54, further comprising a 
nucleotide sequence coding for a T-helper cell epitope containing sequence. 

56. An expression vehicle selected from a group of viral vectors consisting of simliki forest 
20 virus (sfv), adenovirus and Modified Vaccinia Virus Ankara (MVA), further comprising a 

nucleotide sequence construct according to any of claim 14-55. 

57. A method of individualised immunotherapy wherein the virus from a newly diagnosed 
patient is directly cloned, the envelope is produced with highly expressed codons, inserted 

25 into any of the nucleotide sequence constructs of claims 14-55, and administered to the 
patient. 

58. Use of a nucleotide sequence construct according to any of claims 14-55 in medicine. 

30 59. Use of a nucleotide sequence according to any of claims 14-55 for the manufacture of a 
vaccine for the prophylactics of infection with HIV in humans. 



35 



60. Use of a nucleotide sequence according to any of claims 14-55 for the manufacture of a 
composition for the treatment of an HIV infection in a human within 24 weeks of primary 
infection. 
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61 . Use of the nucleotide sequence according to any of claims 14-55 for the production of a 
recombinant protein. 
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Val Trp Lys Asp 
20 

Ala Tyr Asp Thr 
35 

Pro Thr Asp Pro 
50 

Asn Phe Asn Met 

65 

He 



Ala Thr Thr Thr 

Glu Val His Asn 
40 

Asn Pro Gin Glu 
55 

Gly Lys Asn Asn 
70 



Leu Phe Cys Ala 
25 

Val Trp Ala Thr 

Val Val Leu Gly 
60 

Met Val Glu Gin 
75 



Ser Asp Ala Lys 
30 

*His Ala Cys Val 
45 

Asn Val Thr Glu 

Met His Glu Asp 
80 



<210> 3 
<211> 143 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (143) 



<400> 3 

gat ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg 

Asp He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu 

15 io 15 

acc ccc ctg tgc gtg acc ctg aac tgc ace aag ctg aag aac age acc 

Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr 

20 25 30 



48 



96 



gac acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc ag 143 
Asp Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys 
35 40 45 



<210> 4 
<211> 47 
<212> PRT 

<213> Artificial Sequence 



<400> 4 

Asp He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu 

15 10 15 

Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr 

20 25 30 

Asp Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys 
35 40 45 



<210> 5 
<211> 132 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (132) 



<400> 5 

ctg cag ctt caa cat cag cac cag cgt gcg caa caa gat gaa gcg cga 
Leu Gin Leu Gin His Gin His Gin Arg Ala Gin Gin Asp Glu Ala Arg 
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10 15 



gta cgc cct gtt eta cag cct gga cat cgt gec cat cga caa cga caa 

Val Arg Pro Val Leu Gin Pro Gly His Arg Ala His Arg Gin Arg Gin 

20 25 30 

cac cag eta ccg cct gcg cag ctg caa cac ate gat 

His Gin Leu Pro Pro Ala Gin Leu Gin His He Asp 

35 40 



96 



132 



<210> 6 
<211> 44 
<212> PRT 

<213> Artificial Sequence 
<400> 6 

Leu Gin Leu Gin His Gin His Gin Arg Ala Gin Gin Asp Glu Ala Arg 

15 10 15 

Val Arg Pro Val Leu Gin Pro Gly His Arg Ala His Arg Gin Arg Gin 

20 25 30 

His Gin Leu Pro Pro Ala Gin Leu Gin His He Asp 
35 40 

<210> 7 
<211> 161 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 
<222> (1)...(161) 

<400> 7 

ate gat cat cac cca ggc 

He Asp His His Pro Gly 
1 5 

cat cca ctt ctg cgc ccc 
His Pro Leu Leu Arg Pro 
20 

caa gac ctt caa egg cac 
Gin Asp Leu Gin Arg His 
35 

gtg cac cca egg aat tc 
Val His Pro Arg Asn 
50 



ctg ccc caa ggt 
Leu Pro Gin Gly 
10 

cgc egg ctt cgc 
Arg Arg Leu Arg 
25 

egg ccc ctg cac 
Arg Pro Leu His 
40 



gag ctt cga gee 
Glu Leu Arg Ala 



cat cct gaa gtg 
His Pro Glu Val 
30 

caa cgt gag cac 
Gin Arg Glu His 
45 



cat ccc 4 8 

His Pro 
15 

caa caa 96 
Gin Gin 



cgt gca 14 4 

Arg Ala 



161 



<210> 8 
<211> 53 
<212> PRT 

<213> Artificial Sequence 



<400> 8 

He Asp His His Pro Gly Leu Pro Gin Gly Glu Leu Arg Ala His Pro 
15 10 15 
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His Pro Leu Leu Arg Pro Arg Arg 
20 

Gin Asp Leu Gin Arg His Arg Pro 

35 40 
Val His Pro Arg Asn 
50 



Leu Arg His Pro Glu Val Gin Gin 
25 30 
Leu His Gin Arg Glu His Arg Ala 
45 



<210> 9 
<211> 254 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (254) 



<400> 9 

gaa ttc gcc ccg tgg tga gca ccc age tgc tgc tga acg gca gec tgg 4 8 

Glu Phe Ala Pro Trp * Ala Pro Ser Cys Cys + Thr Ala Ala Trp 
15 10 



ccg agg agg agg tgg tga tea gat ctg aga act tea cca aca acg cca 96 
Pro Arg Arg Arg Trp * Ser Asp Leu Arg Thr Ser Pro Thr Thr Pro 
15 20 25 



aga cca tea teg tgc age tga acg aga gcg tgg aga tea act gca ccc 14 4 

Arg Pro Ser Ser Cys Ser * Thr Arg Ala Trp Arg Ser Thr Ala Pro 
30 35 40 



gcc cca aca aca aca ccc gca aga gca tec aca teg gcc ctg gcc gcg 192 
Ala Pro Thr Thr Thr Pro Ala Arg Ala Ser Thr Ser Ala Leu Ala Ala 
45 50 55 60 



cct tct aca cca ccg gcg aca tea teg gcg aca tec gcc agg ccc act 
Pro Ser Thr Pro Pro Ala Thr Ser Ser Ala Thr Ser Ala Arg Pro Thr 

65 70 75 



240 



gca aca tct eta ga 
Ala Thr Ser Leu 
80 



<210> 10 
<211> 80 
<212> PRT 

<213> Artificial Sequence 
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Ala 


Arg 
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Ser 
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Ala 


Ala 


Pro 
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Pro 




50 










55 
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Ala 
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Ser 
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Thr 


Ser 


Ala 


Arg 
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Ser 
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65 










70 










75 










80 
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<210> 


11 






<211> 


92 






<212> 


DNA 






<213> 


Artificial Sequence 




<220> 

^ v/ ^ 










CDS 






<222> 


(1) . 


. . (92) 




<400> 


11 




tct 


aga acc 


aac 


tgg acc aac acc 


Ser 


Arg Thr 


Asn 


Trp Thr Asn Thr 








5 


cgc 


gag aag 


ttc 


aac aac acc acc 


Arg 


Glu Lys 


Phe 


Asn Asn Thr Thr 






20 





:tg aag cgc gtg gcc gag aag ctg 4 8 

jeu Lys Arg Val Ala Glu Lys Leu 

10 15 

itc gtg ttc aac cag age tc 92 

:ie Val Phe Asn Gin Ser 

25 30 



<210> 12 
<211> 30 
<212> PRT 

<213> Artificial Sequence 



<400> 12 

Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 

15 10 15 

Arg Glu Lys Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser 
20 25 30 



<210> 13 
<211> 130 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (130) 



<400> 13 

gag etc egg egg cga ccc cga gat cgt gat gca cag ctt caa ctg egg 4 8 

Glu Leu Arg Arg Arg Pro Arg Asp Arg Asp Ala Gin Leu Gin Leu Arg 
15 10 15 



egg cga gtt ctt eta ctg caa cac cac cca get gtt caa cag cac ctg 96 

Arg Arg Val Leu Leu Leu Gin His His Pro Ala Val Gin Gin His Leu 

20 25 30 

gaa cga gac caa cag cga ggg caa cat cac tag t 130 

Glu Arg Asp Gin Gin Arg Gly Gin His His * 

35 40 



<210> 14 
<211> 42 
<212> PRT 

<213> Artificial Sequence 



<400> 14 
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Glu Leu Arg Arg Arg Pro Arg Asp Arg Asp Ala Gin Leu Gin Leu Arg 

15 10 15 

Arg Arg Val Leu Leu Leu Gin His His Pro Ala Val Gin Gin His Leu 

20 25 30 

Glu Arg Asp Gin Gin Arg Gly Gin His His 
35 40 



<210> 


15 


<211> 


164 


<212> 


DNA 


<213> 


Artificial 


<220> 




<221> 


CDS 


<222> 


(1) . . . (164) 


<400> 


15 


agt ggc 


acc ate acc 


Ser Gly 


Thr He Thr 



10 15 



48 



atg tgg cag gag gtg ggc aag gec atg tac gec ccc ccc ate ggc ggc 96 
Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro He Gly Gly 
20 25 30 

cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg acc cgc gac 144 
Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp 
35 40 45 

ggc ggc age gac aac teg ag 164 
Gly Gly Ser Asp Asn Ser 
50 



<210> 16 
<211> 54 
<212> PRT 

<213> Artificial Sequence 





<400> 


16 
























Thr 


Ser Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys Arg 


He 


Lys 


Gin 


lie 


lie 


Asn 


1 






5 








10 










15 




Met 


Trp Gin 


Glu 
20 


Val 


Gly 


Lys 


Ala 


Met Tyr 
25 


Ala 


Pro 


Pro 


lie 
30 


Gly 


Gly 


Gin 


He Lys 
35 


Cys 


Leu 


Ser 


Asn 


He 
40 


Thr Gly 


Leu 


Leu 


Leu 
45 


Thr 


Arg 


Asp 


Gly 


Gly Ser 
50 


Asp 


Asn 


Ser 





















<210> 17 
<211> 200 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (200) 

<400> 17 



WO 00/29561 



PCT/DK00/00144 



7 



etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 

20 25 30 

gee cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 144 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 

35 40 45 

cga gaa gcg cgc cgt ggg cat egg cgc-^tat gtt cct egg ctt cct ggg 192 

Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 

50 55 60 

cgc tgc ag 200 
Arg Cys 
65 



<210> 18 
<211> 66 
<212> PRT 

<213> Artificial Sequence 
<400> 18 



Leu 


Glu 


Gin 


Arg 


Gin Gly Asp 


Phe 


Pro 


Pro 


Arg 


Arg 


Arg 


Arg 


His Ala 


1 








5 






10 










15 


Arg 


Gin 


Leu 


Ala 


Gin Arg Ala 


Val 


Gin 


Val 


Gin 


Gly 


Gly 


Glu 


Asp Arg 








20 






25 










30 




Ala 


Pro 


Gly 


His 


Arg Pro His 


Gin 


Gly 


Gin 


Ala 


Pro 


Arg 


Gly 


Ala Ala 






35 






40 










45 






Arg 


Glu 


Ala 


Arg 


Arg Gly His 


Arg 


Arg 


Tyr 


Val 


Pro 


Arg 


Leu 


Pro Gly 



50 55 60 

Arg Cys 
65 



<210> 19 
<211> 212 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (212) 

<400> 19 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 4 8 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 
15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 



gee cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 
Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 



144 



WO 00/29561 



8 



PCT/DK00/00144. 



cga gaa gcg cgc eta ggg cat egg cgc tat gtt cct egg ctt cct ggg 

Arg Glu Ala Arg Leu Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 

50 55 60 

cgc tgc age ccg ggg gat cc 
Arg Cys Ser Pro Gly Asp 
65 70 



<210> 20 
<211> 70 
<212> PRT 

<213> Artificial Sequence 
<400> 20 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

1 5 10 15 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 

20 25 30 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 

35 40 45 

Arg Glu Ala Arg Leu Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 

50 55 60 

Arg Cys Ser Pro Gly Asp 
65 70 

<210> 21 
<211> 200 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (200) 

<400> 21 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 



cgc tgc ag 
Arg Cys 
65 



48 



96 



gee cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 

35 40 45 

cga gaa gag cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 

Arg Glu Glu Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 

50 55 60 



200 



<210> 22 



WO 00/29561 



9 



PCT/DK00/00144 



<211> 66 
<212> PRT 

<213> Artificial Sequence 





<400> 


22 






Leu 


Glu 


Gin 


Arg 


Gin 


Gly Asp Phe 


1 








5 




Arg 


Gin 


Leu 


Ala 


Gin 


Arg Ala Val 








20 






Ala 


Pro 


Gly 


His 


Arg 


Pro His Gin 






35 






40 


Arq 


Glu 


Glu 


Arg 


Arg 


Gly His Arg 



50 55 
Arg Cys 
65 



<210> 23 
<211> 178 
<212> DNA 

<213> Artificial Sequence 



Pro Pro Arg Arg Arg Arg His Ala 

10 15 
Gin Val Gin Gly Gly Glu Asp Arg 
25 30 
Gly Gin Ala Pro Arg Gly Ala Ala 
45 

Arg Tyr Val Pro Arg Leu Pro Gly 
60 



<220> 

<221> CDS 

<222> (1) . . . (178) 

<400> 23 

ctg cag gca gca cca tgg gcg ccg cca gcc tga ccc tga ccg tgc agg 4 8 

Leu Gin Ala Ala Pro Trp Ala Pro Pro Ala * Pro * Pro Cys Arg 
15 10 



ccc gcc age tgc tga gcg gca teg tgc age age aga aca acc tgc tgc 96 
Pro Ala Ser Cys * Ala Ala Ser Cys Ser Ser Arg Thr Thr Cys Cys 
15 20 25 



gcg cca teg agg ccc age age acc tgc tec age tga ccg tgt ggg gca 144 
Ala Pro Ser Arg Pro Ser Ser Thr Cys Ser Ser * Pro Cys Gly Ala 
30 35 40 



tea age age tec agg ccc gcg tgc tgg etc tag a 178 
Ser Ser Ser Ser Arg Pro Ala Cys Trp Leu * 
45 50 



<210> 24 
<211> 54 
<212> PRT 

<213> Artificial Sequence 





<400> 


24 
























Leu 


Gin Ala 


Ala 


Pro 


Trp 


Ala 


Pro 


Pro 


Ala 


Pro Pro 


Cys 


Arg 


Pro 


Ala 


1 






5 










10 








15 




Ser 


Cys Ala 


Ala 
20 


Ser 


Cys 


Ser 


Ser 


Arg 
25 


Thr 


Thr Cys 


Cys 


Ala 
30 


Pro 


Ser 


Arg 


Pro Ser 
35 


Ser 


Thr 


Cys 


Ser 


Ser 
40 


Pro 


Cys 


Gly Ala 


Ser 
45 


Ser 


Ser 


Ser 


Arg 


Pro Ala 
50 


Cys 


Trp 


Leu 





















<210> 25 



WO 00/29561 



10 



PCT/DK00/00144 



<211> 178 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)...(178) 

<400> 25 

ctg cag gca gca cca tgg gcg ccg cca 

Leu Gin Ala Ala Pro Trp Ala Pro Pro 

1 5 

ccc gcc age tgc tga gcg gca teg tgc 
Pro Ala Ser Cys * Ala Ala Ser Cys 
15 20 

gcg cca teg agg ccc age age acc tgc 
Ala Pro Ser Arg Pro Ser Ser Thr Cys 
30 35 

tea age agt get gcg gcc gcg tgc tgg 
Ser Ser Ser Ala Ala Ala Ala Cys Trp 
45 50 



gcc tga ccc tga ccg tgc agg 48 
Ala * Pro * Pro Cys Arg 
10 

age age aga aca acc tgc tgc 96 
Ser Ser Arg Thr Thr Cys Cys 
25 

tec age tga ccg tgt ggg gca 14 4 

Ser Ser * Pro Cys Gly Ala 
40 

etc tag a 178 
Leu * 



<210> 26 
<211> 54 
<212> PRT 

<213> Artificial Sequence 
<400> 26 

Leu Gin Ala Ala Pro Trp Ala Pro Pro Ala Pro Pro Cys Arg Pro Ala 

1 5 10 15 

Ser Cys Ala Ala Ser Cys Ser Ser Arg Thr Thr Cys Cys Ala Pro Ser 

20 25 30 

Arg Pro Ser Ser Thr Cys Ser Ser Pro Cys Gly Ala Ser Ser Ser Ala 

35 40 45 

Ala Ala Ala Cys Trp Leu 
50 

<210> 27 
<211> 77 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (77) 

<400> 27 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 48 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

1 5 10 15 

etc egg caa get gat ctg cac cac ggc eg 77 
Leu Arg Gin Ala Asp Leu His His Gly 
20 25 



WO 00/29561 



11 



PCT/DKOO/00144 



<210> 28 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<400> 28 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

15 10 15 

Leu Arg Gin Ala Asp Leu His His Gly 
20 25 

<210> 29 
<211> 190 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (190) 

<400> 29 

egg ccg tgc cct gga acg cca get gga gca aca aga acc tga gec aga 

Arg Pro Cys Pro Gly Thr Pro Ala Gly Ala Thr Arg Thr * Ala Arg 

1 5 10 15 

ttt ggg aca aca tga cct gga tgg agt ggg age gcg aga tea gca act 
Phe Gly Thr Thr * Pro Gly Trp Ser Gly Ser Ala Arg Ser Ala Thr 

20 25 30 

aca ccg aga tea tct aca gec tga teg agg aga gec aga acc age agg 
Thr Pro Arg Ser Ser Thr Ala * Ser Arg Arg Ala Arg Thr Ser Arg 

35 40 45 

aga aga acg age tgg acc tgc tec age tgg aca agt ggg caa get t 
Arq Arq Thr Ser Trp Thr Cys Ser Ser Trp Thr Ser Gly Gin Ala 

50 55 60 



48 



96 



144 



190 



<210> 30 
<211> 60 
<212> PRT 

<213> Artificial Sequence 
<400> 30 

Arg Pro Cys Pro Gly Thr Pro Ala Gly Ala Thr Arg Thr Ala Arg Phe 

15 10 15 

Gly Thr Thr Pro Gly Trp Ser Gly Ser Ala Arg Ser Ala Thr Thr Pro 

20 25 30 

Arg Ser Ser Thr Ala Ser Arg Arg Ala Arg Thr Ser Arg Arg Arg Thr 

35 40 45 

Ser Trp Thr Cys Ser Ser Trp Thr Ser Gly Gin Ala 
50 55 60 

<210> 31 

<211> 177 

<212> DNA 

<213> Artificial Sequence 



WO 00/29561 



12 



PCT/DKOO/00144 



<220> 

<221> CDS 

<222> (1) . . . (177) 

<400> 31 

aag ctt gtg gaa ctg gtt caa cat cac caa ctg get gtg gta cat caa 4 8 

Lys Leu Val Glu Leu Val Gin His His Gin Leu Ala Val Val His Gin 
15 10 15 

gat ttt cat cat gat cgt ggg egg cct gat egg cct gcg cat cgt gtt 96 
Asp Phe His His Asp Arg Gly Arg Pro Asp Arg Pro Ala His Arg Val 
20 25 30 



cac cgt get gag cat cgt gaa ccg cgt gcg cca ggg eta cag ccc cct 
His Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Leu Gin Pro Pro 
35 40 45 



144 



gag ctt cca gac ccg cct gee cgt gee ccg egg 177 
Glu Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
50 55 



<210> 32 
<211> 59 
<212> PRT 

<213> Artificial Sequence 
<400> 32 

Lys Leu Val Glu Leu Val Gin His His Gin Leu Ala Val Val His Gin 

15 10 15 

Asp Phe His His Asp Arg Gly Arg Pro Asp Arg Pro Ala His Arg Val 

20 25 30 

His Arg Ala Glu His Arg Glu Pro Arg Ala Pro Gly Leu Gin Pro Pro 

35 40 45 

Glu Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
50 55 

<210> 33 
<211> 140 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (140) 

<400> 33 

ccg egg ccc cga ccg ccc cga ggg cat cga gga gga ggg egg cga gcg 4 8 

Pro Arg Pro Arg Pro Pro Arg Gly His Arg Gly Gly Gly Arg Arg Ala 

15 10 15 

cga ccg cga ccg cag cac ccg cct ggt gac egg ctt cct gec cct gat 96 
Arg Pro Arg Pro Gin His Pro Pro Gly Asp Arg Leu Pro Ala Pro Asp 
20 25 30 

ctg gga cga cct gcg cag cct gtt cct gtt cag eta cca teg at 140 
Leu Gly Arg Pro Ala Gin Pro Val Pro Val Gin Leu Pro Ser 
35 40 45 



WO 00/29561 
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<210> 


34 


















<211> 


46 


















<212> 


PRT 


















<213> 


Artificial Sequence 
















<400> 


34 
















Pro 


Arg Pro 


Arg Pro 


Pro Arg Gly His Arg Gly Gly Gly Arg Arg 


Ala 


1 


5 




10 








15 




Arg 


Pro Arg 


Pro Gin 


His Pro Pro 


Gly Asp Arg 


Leu 


Pro 


Ala 


Pro 


Asp 


20 




25 






30 






Leu 


Gly Arg 


Pro Ala 


Gin Pro Val 


Pro Val Gin 


Leu 


Pro 


Ser 








35 




40 






45 










<210> 


35 


















<211> 


129 


















<212> 


DNA 


















<213> 


Artificial Sequence 
















<220> 




















<221> 


CDS 


















<222> 


(1) . (129) 
















<400> 


35 
















ate 


gat tgc 


gcg acc 


tgc tgc tga 


teg tgg ccc 


gca 


teg 


tgg 


age 


tgc 


He 


Asp Cys 


Ala Thr 


Cys Cys * 


Ser Trp Pro 


Ala 


Ser 


Trp 


Ser 


Cys 


1 




5 




10 










15 


tgg 


gec ggc 


gcg get 


ggg aga tec 


tga agt act 


ggt 


gga 


acc 


tgc 


tec 


Trp 


Ala Gly 


Ala Ala 


Gly Arg Ser 


+ Ser Thr 


Gly Gly 


Thr 


Cys 


Ser 






20 




25 










30 



48 



96 



agt act gga gee agg age tga aga act ctg cag 129 
Ser Thr Gly Ala Arg Ser * Arg Thr Leu Gin 

35 40 



<210> 36 
<211> 40 
<212> PRT 

<213> Artificial Sequence 

<400> 36 
He Asp Cys Ala Thr Cys Cys Ser 

1 5 
Ala Gly Ala Ala Gly Arg Ser Ser 
20 

Gly Ala Arg Ser Arg Thr Leu Gin 
35 40 

<210> 37 

<211> 114 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (114) 



Trp Pro Ala Ser Trp Ser Cys Trp 

10 15 

Thr Gly Gly Thr Cys Ser Ser Thr 
25 30 



WO 00/29561 
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<400> 37 

ctg cag tga gcc tgc tga acg cca ccg cca teg ccg tgg ccg agg gca 48 

Leu Gin * Ala Cys * Thr Pro Pro Pro Ser Pro Trp Pro Arg Ala 

1 5 10 

ccg acc gcg tga teg agg tgg tgc age gca tct ggc gcg gca tec tgc 96 

Pro Thr Ala * Ser Arg Trp Cys Ser Ala Ser Gly Ala Ala Ser Cys 

15 20 25 

aca tec cca ccc gaa ttc 114 

Thr Ser Pro Pro Glu Phe 

30 35 



<210> 38 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<400> 38 

Leu Gin Ala Cys Thr Pro Pro Pro Ser Pro Trp Pro Arg Ala Pro Thr 

15 10 15 

Ala Ser Arg Trp Cys Ser Ala Ser Gly Ala Ala Ser Cys Thr Ser Pro 
20 25 30 

Pro Glu Phe 
35 





<210> 


39 






<211> 


41 






<212> 


DNA 






<213> 


Artificial Sequence 






<220> 








<221> 


CDS 






<222> 


(1) . . - (41) 






<400> 


39 




gaa 


ttc gcc 


agg get teg age gcg 


ccc tgc 


Glu 


Phe Ala 


Arg Ala Ser Ser Ala 


Pro Cys 


1 




5 


10 



41 



<210> 40 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<400> 40 

Glu Phe Ala Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 
15 10 

<210> 41 
<211> 506 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (506) 



WO 00/29561 
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PCT/DK00/00I44 



<400> 41 

get age gcg gee gac cgc ctg tgg gtg ace gtg tac tac ggc gtg ccc 48 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 

gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gec tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gec ctg 432 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 
Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

cgc ctg cgc age tgc aac aca teg at 506 
Arg Leu Arg Ser Cys Asn Thr Ser 
165 



<210> 42 
<211> 168 
<212> PRT 

<213> Artificial Sequence 
<400> 42 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

15 10 15 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 



WO 00/29561 
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Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 

50 55 60 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

lie He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 110 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

f*re Tyr Ser Leu Asp He Val Pro lie-: Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

Arg Leu Arg Ser Cys Asn Thr Ser 
165 

<210> 43 
<211> 374 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (374) 

<400> 43 

tct aga acc aac tgg acc aac acc ctg aag cgc gtg gcc gag aag ctg 48 

Ser Arg Thr Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu 
15 10 15 

cgc gag aag ttc aac aac acc acc ate gtg ttc aac cag age tec ggc 96 
Arg Glu Lys Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly 
20 25 30 

ggc gac ccc gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc 144 
Gly Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe 
35 40 45 

ttc tac tgc aac acc acc cag ctg ttc aac age acc tgg aac gag acc 192 
Phe Tyr Cys Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr 
50 55 60 

aac age gag ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc 240 
Asn Ser Glu Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg 
65 70 75 80 

ate aag cag ate ate aac atg tgg cag gag gtg ggc aag gcc atg tac 288 
He Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr 

85 90 95 

gcc ccc ccc ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc 336 
Ala Pro Pro He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly 
100 105 HO 

ctg ctg ctg acc cgc gac ggc ggc age gac aac teg ag 374 
Leu Leu Leu Thr Arg Asp Gly Gly Ser Asp Asn Ser 
115 120 



WO 00/29561 
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<210> 44 
<211> 124 
<212> PRT 
<213> Artificial 



Sequence 



<400> 44 



Ser 


Arg 


Thr 


Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala 


Glu 


Lys 


Leu 


1 






5 










10 










15 




Arg 


Glu 


Lys 


Phe 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin 


Ser 


Ser 


Gly 




20 










25 










30 






Glv 


Asp 


Pro 


Glu 


He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly Gly Glu 


Phe 




35 










40 










45 






Thr 


Phe 


Tyr 


Cys 


Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 




50 








55 










60 










Asn 


Ser 


Glu 


Gly 


Asn 


lie 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


65 








70 










75 










80 


He 


Lys 


Gin 


He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 








85 










90 










95 




Ala 


Pro 


Pro 


He 


Gly Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 








100 










105 










110 






Leu 


Leu 


Leu 


Thr 


Arg 


Asp 


Gly 


Gly 


Ser 


Asp 


Asn 


Ser 











115 120 



<210> 45 
<211> 1277 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . - (1277) 

<400> 45 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

15 10 15 

gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 



ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 

65 . 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 



48 



96 



gec tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 



192 



240 



288 



336 
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Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 HO 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 



cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 175 



384 



aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gec ctg 432 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 
•Phe Tyr Ser Leu Asp lie Val Pro Il-e— Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 



528 



gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gee ggc ttc 576 

Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gec ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 

Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gee gag gag gag gtg gtg 720 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 

225 230 235 240 

ate aga tct gag aac ttc acc aac aac gee aag acc ate ate gtg cag 768 

lie Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr lie lie Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 

Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gec ttc tac acc acc ggc 864 

Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate ggc gac ate cgc cag gec cac tgc aac ate tct aga acc 912 

Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg 1 aag cgc gtg gee gag aag ctg cgc gag aag 960 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 

305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 

Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 



WO 00/29561 



19 



PCT/DK00/00144 



gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 

Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 

355 360 365 



1056 



1104 



ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 1152 

Gly Asn lie Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 

370 375 380 

artrc ate aac atg tgg cag gag gtg ggc— aag gee atg tac gee ccc ccc 1200 

He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 

He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
405 410 415 . 

acc cgc gac ggc ggc age gac aac teg ag 1277 
Thr Arg Asp Gly Gly Ser Asp Asn Ser 
420 425 



<210> 46 

<211> 425 

<212> PRT 

<213> Artificial Sequence 



<400> 46 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 




20 










25 










30 






Ala 


Tyr 


Asp 
35 


Thr 


Glu 


Val 


His 


Asn 
40 


Val 


Trp 


Ala 


Thr 


His 
45 


Ala 


Cys 


Val 


Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 




50 








55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


lie 


Ser 


Leu 


Trp 
85 


Asp 


Gin 


Ser 


Leu 


Lys 
90 


Pro 


Cys 


Val 


Lys 


Leu 
95 


Thr 


Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 






100 










105 










110 






Thr 


Asn 


Asn 
115 


Thr 


Arg 


Trp 


Gly 


Thr 
120 


Gin 


Glu 


Met 


Lys 


Asn 
125 


Cys 


Ser 


Phe 


Asn 


He 
130 


Ser 


Thr 


Ser 


Val 


Arg 
135 


Asn 


Lys 


Met 


Lys 


Arg 
140 


Glu 


Tyr 


Ala 


Leu 


Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


He 


Asp 


Asn 


Asp 


Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 
165 


Asn 


Thr 


Ser 


He 


He 
170 


Thr 


Gin 


Ala 


Cys 


Pro 
175 


Lys 


Val 


Ser 


Phe 


Glu 
180 


Pro 


He 


Pro 


He 


His 
185 


Phe 


Cys 


Ala 


Pro 


Ala 
190 


Gly 


Phe 


Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 






195 








200 










205 








Thr 


Asn 
210 


Val 


Ser 


Thr 


Val 


Gin 
215 


Cys 


Thr 


His 


Gly 


He 
220 


Arg 


Pro 


Val 


Val 
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Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu 


Ala 


Glu 


Glu 


Glu 


Val 


Val 


225 










230 










235 










240 


lie 


Arq 


Ser 


Glu 


Asn 


Phe 


Thr 


Asn 


Asn 


Ala 


Lys 


Thr 


He 


He 


Val 


Gin 








245 










250 










255 




Leu 


Asn 


Glu 


Ser 


Val 


Glu 


He 


Asn 


Cys 


Thr 


Arg 


Pro 


Asn 


Asn 


Asn 


Thr 








260 










265 










270 






Arq 


Lys 


Ser 


He 


His 


He 


Gly 


Pro 


Gly Arg 


Ala 


Phe 


Tyr 


Thr 


Thr 


Gly 






275 










280 










285 








Asp 


He 


He 


Gly 


Asp 


He 


Arq 


Gin 


Ala 


His 


Cys 


Asn 


He 


Ser 


Arg 


Thr 




290 










295 










300 










Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arq 


Val 


Ala 


Glu 


Lys 


Leu 


Arg 


Glu 


Lys 


305 










310 










315 










320 




Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asrr-Gln 


Ser 


Ser 


Gly 


Gly 


Asp 


Pro 










325 










330 










335 




Glu 


He 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly 


Gly 


Glu 


Phe 


Phe 


Tyr 


Cys 








340 










345 










350 






Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp 


Asn 


Glu 


Thr 


Asn 


Ser 


Glu 






355 










360 










365 








Gly 


Asn 


He 


Thr 


Ser 


Gly 


Thr 


He 


Thr 


Leu 


Pro 


Cys 


Arg 


He 


Lys 


Gin 




370 










375 










380 










He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys 


Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


385 










390 










395 










400 


He 


Gly 


Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn 


He 


Thr 


Gly 


Leu 


Leu 


Leu 










405 










410 










415 




Thr 


Arg 


Asp 


Gly 


Gly Ser Asp Asn 


Ser 

















420 425 

<210> 47 
<211> 1277 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (1277) 

<400> 47 
get age gcg gec gac cgc 
Ala Ser Ala Ala Asp Arg 
1 5 

gtg tgg aag gac gec ace 
Val Trp Lys Asp Ala Thr 
20 

gee tac gac ace gag gtg 
Ala Tyr Asp Thr Glu Val 
35 

ccc acc gac ccc aac ccc 
Pro Thr Asp Pro Asn Pro 
50 

aac ttc aac atg ggc aag 
Asn Phe Asn Met Gly Lys 
65 70 



ctg tgg gtg 
Leu Trp Val 



acc acc ctg 
Thr Thr Leu 
25 

cac aac gtg 
His Asn val 
40 

cag gag gtg 
Gin Glu Val 
55 

aac aac atg 
Asn Asn Met 



acc gtg tac 
Thr Val Tyr 
10 

ttc tgc gec 
Phe Cys Ala 



tgg gec acc 
Trp Ala Thr 



gtg ctg ggc 
Val Leu Gly 
60 

gtg gag cag 
Val Glu Gin 
75 



tac ggc gtg 
Tyr Gly Val 
15 

age gac gee 
Ser Asp Ala 
30 

cac gcg tgc 
His Ala Cys 
45 

aac gtg acc 
Asn Val Thr 



atg cac gag 
Met His Glu 



ccc 4 8 

Pro 



aag 96 
Lys 



gtg 144 
Val 



gag 192 
Glu 



gat 240 
Asp 
80 



ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 288 
He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 



WO 00/29561 



21 



PCT/DKOO/00144 



85 90 95 

ccc ctg tgc gtg acc ctg caa tgc acc aag ctg aag cag age acc gac 336 
Pro Leu Cys Val Thr Leu Gin Cys Thr Lys Leu Lys Gin Ser Thr Asp 
100 105 110 

acc cag aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 

Thr Gin Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 432 

Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 

Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 528 

Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 

165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gee ggc ttc 576 

Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gee ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 

Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 

210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gec gag gag gag gtg gtg 720 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 768 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 

Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 864 

Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate ggc gac ate cgc cag gec cac tgc aac ate tct aga acc 912 

Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 

290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gee gag aag ctg cgc gag aag 960 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 



ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 



1008 
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Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 

Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

340 345 350 

aac acc acc cag ctg ttc aac age ace tgg aac gag acc aac age gag 1104 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 

355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 1152 

"G±y Asn He Thr Ser Gly Thr He Thr— Leu Pro Cys Arg He Lys Gin 

370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gee atg tac gec ccc ccc 1200 

He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 

385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 

He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 

405 410 415 

acc cgc gac ggc ggc age gac aac teg ag 1277 

Thr Arg Asp Gly Gly Ser Asp Asn Ser 

420 425 



<210> 48 
<211> 425 
<212> PRT 

<213> Artificial Sequence 



<400> 48 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 






Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 








Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly 


Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Gin 


Cys 


Thr 


Lys 


Leu 


Lys 


Gin 


Ser 


Thr 


Asp 








100 










105 










110 






Thr 


Gin 


Asn 


Thr 


Arg 


Trp 


Gly 


Thr 


Gin 


Glu 


Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 










125 








Asn 


He 


Ser 


Thr 


Ser 


Val 


Arg 


Asn 


Lys 


Met 


Lys 


Arg 


Glu 


Tyr Ala 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Leu 


Asp 


He 


Val 


Pro 


lie 


Asp 


Asn 


Asp Asn 


Thr 


Ser 


Tyr 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


lie 


He 


Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 










175 




Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


His 


Phe 


Cys 


Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 










190 






Ala 


He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thr 


Phe 


Asn 


Gly 


Thr 


Gly 


Pro 


Cys 
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195 










200 








205 








Thr 


Asn 
210 


Val 


Ser 


Thr 


Val 


Gin 
215 


Cys 


Thr 


His Gly 


He 
220 


Arg 


Pro 


Val 


Val 


Ser 


Thr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu Ala 


Glu 


Glu 


Glu 


Val 


Val 


225 










230 








235 










240 


He 


Arg 


Ser 


Glu 


Asn 
245 


Phe 


Thr 


Asn 


Asn 


Ala Lys 
250 


Thr 


He 


He 


Val 
255 


Gin 


Leu 


Asn 


Glu 


Ser 
260 


Val 


Glu 


He 


Asn 


Cys 
265 


Thr Arg 


Pro 


Asn 


Asn 
270 


Asn 


Thr 


Arg 


Lys 


Ser 


He 


His 


He 


Gly 


Pro 


Gly Arg Ala 


Phe 


Tyr 


Thr 


Thr 


Gly 






275 










280 








285 








Asp 


He 


He 


Gly 


Asp 


He 


Arg 


Gin 


Ala 


His Cys 


Asn 


He 


Ser Arg 


Thr 




290 










295 








300 










Asn 


Trp 


Thr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala Glu 


Lys 


Leu 


Arg 


Glu 


Lys 


305 










310 








315 










320 


Phe 


Asn 


Asn 


Thr 


Thr 


He 


Val 


Phe 


Asn 


Gin Ser 


Ser 


Gly 


Gly Asp 


Pro 










325 










330 








335 




Glu 


He 


Val 


Met 
340 


His 


Ser 


Phe 


Asn 


Cys 
345 


Gly Gly 


Glu 


Phe 


Phe 
350 


Tyr 


Cys 


Asn 


Thr 


Thr 
355 


Gin 


Leu 


Phe 


Asn 


Ser 
360 


Thr 


Trp Asn 


Glu 


Thr 
365 


Asn 


Ser 


Glu 


Gly 


Asn 
370 


He 


Thr 


Ser 


Gly 


Thr 
375 


He 


Thr 


Leu Pro 


Cys 
380 


Arg 


He 


Lys 


Gin 


He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly Lys Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


385 










390 








395 










400 


He 


Gly 


Gly 


Gin 


He 
405 


Lys 


Cys 


Leu 


Ser 


Asn He 
410 


Thr 


Gly 


Leu 


Leu 
415 


Leu 


Thr 


Arg 


Asp 


Gly 
420 


Gly 


Ser 


Asp 


Asn 


Ser 
425 















<210> 49 
<211> 144 
<212> PRT 

<213> Artificial Sequence 

<400> 49 
get age gcg gec gac 
gtg tgg aag gac gec 
gec tac gac acc gag 
ccc acc gac ccc aac 
aac ttc aac atg ggc 
ate ate age ctg tgg 
ccc ctg tgc gtg acc 
acc cag aac acc cgc 
cag ate age acc age 
ttc tac age ctg gac 
cgc ctg cgc age tgc 
gtg age ttc gag ccc 
gee ate ctg aag tgc 
acc aac gtg age acc 
age acc cag ctg ctg 
ate aga tct gag aac 
ctg aac gag age gtg 
cgc aag age ate cac 
gac ate ate ggc gac 
aac tgg acc aac acc 
ttc aac aac acc acc 
gag ate gtg atg cac 
aac acc acc cag ctg 



cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 

acc acc acc ctg ttc tgc gec age gac gec aag 96 

gtg cac aac gtg tgg gec acc cac gcg tgc gtg 144 

ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 

aag aac aac atg gtg gag cag atg cac gag gat 240 

gac cag age ctg aag ccc tgc gtg aag ctg acc 288 

ctg caa tgc acc aag ctg aag cag age acc gac 336 

tgg ggc acc cag gag atg aag aac tgc age ttc 384 

gtg cgc aac aag atg aag cgc gag tac gec ctg 432 

ate gtg ccc ate gac aac gac cag acc age tac 480 

aac aca teg ate ate acc cag gee tgc ccc aag 528 

ate ccc ate cac ttc tgc gec ccc gec ggc ttc 576 

aac aac aag acc ttc aac ggc acc ggc ccc tgc 624 

gtg cag tgc acc cac gga att cgc ccc gtg gtg 672 

ctg aac ggc age ctg gee gag gag gag gtg gtg 720 

ttc acc aac aac gec aag acc ate ate gtg cag 768 

gag ate aac tgc acc cgc ccc aac aac aac acc 816 

ate ggc cct ggc cgc gee ttc tac acc acc ggc 864 

ate cgc cag gec cac tgc aac ate tct aga acc 912 

ctg aag cgc gtg gec gag aag ctg cgc gag aag 960 

ate gtg ttc aac cag age tec ggc ggc gac ccc 1008 

age ttc aac tgc ggc ggc gag ttc ttc tac tgc 1056 

ttc aac age acc tgg aac gag acc aac age gag 1104 



a 
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ggc aac ate act agt ggc acc ate ace ctg ccc tgc cgc ate aag cag 

ate ate aac atg tgg cag gag gtg ggc aag gec atg tac gec ccc ccc 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 

acc cgc gac ggc ggc age gac aac teg ag 



1152 
1200 
1248 
1277 



<210> 50 
<211> 425 
<212> PRT 

<213> Artificial Sequence 
<400> 50 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

5 10' 15 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 

35 40 45 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 

50 55 60 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

Pro Leu Cys Val Thr Leu Gin Cys Thr Lys Leu Lys Gin Ser Thr Asp 

100 105 110 

Thr Gin Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

Gin He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Gin Thr Ser Tyr 
145 150 155 160 

Arg Leu Arg Ser Cys Asn Thr Ser He lie Thr Gin Ala Cys Pro Lys 

165 170 175 

Val Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 

210 215 220 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

lie Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr lie lie Val Gin 

245 250 255 

Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr 

260 265 270 

Arg Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 

275 280 285 

Asp lie lie Gly Asp He Arg Gin Ala His Cys Asn lie Ser Arg Thr 

290 295 300 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 330 335 

Glu lie Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

340 345 350 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 

355 360 365 

Gly Asn He Thr Ser Gly Thr lie Thr Leu Pro Cys Arg lie Lys Gin 
370 375 380 
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lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

lie Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 

405 410 415 

Thr Arg Asp Gly Gly Ser Asp Asn Ser 
420 425 



<210> 51 
<211> 144 
<212> PRT 

<213> Artificial Sequence 





<400> 


51 




























get 


age 


gcg 


gec 


gac 


cgc 


ctg 


tgg 


gtg 


acc 


gtg 


tac 


tac 


ggc 


gtg 


ccc 


48 


gtg 


tgg 


aag 


gac 


gee 


acc 


acc 


acc 


ctg 


ttc 


tgc 


gec 


age 


gac 


gec 


aag 


96 


gec 


tClL. 




acc 


gag 


gtg 


cac 


aac 


gtg 


tgg 


gec 


acc 


cac 


gcg 


tgc 


gtg 


144 


ccc 


acc 


gac 


ccc 


aac 


ccc 


cag 


gag 


gtg 


gtg 






CI Q \ — . 


y 


acc 


nan 

yay 


192 


aac 


ttc 


aac 


atg 


ggc 


aag 


aac 


aac 


atg 


gtg 


gag 


cag 


atg 


cac 


gag 


gat 


240 


ate 


ate 


age 


ctg 


tgg 


gac 


cag 


age 


ctg 


aag 


ccc 


tgc 


gtg 


aag 


ctg 


acc 


288 


ccc 


ctg 


tgc 


gtg 


acc 


ctg 


aac 


tgc 


acc 


aag 


ctg 


aag 


aac 


age 


acc 


gac 


336 


acc 


aac 


aac 


acc 


cgc 


tgg 


ggc 


acc 


cag 


gag 


atg 


aag 


aac 


tgc 


age 


ttc 


384 


cag 


ate 


age 


acc 


age 


gtg 


cgc 


aac 


aag 


atg 


aag 


cgc 


gag 


tac 


gec 


ctg 


4 32 


ttc 


tac 


age 


ctg 


gac 


ate 


gtg 


ccc 


ate 


gac 


aac 


gac 


cag 


acc 


age 


tac 


480 


cgc 


ctg 


cgc 


age 


tgc 


aac 


aca 


teg 


ate 


ate 


acc 


cag 


gee 


tgc 


ccc 


aag 


528 


gtg 


age 


ttc 


gag 


ccc 


ate 


ccc 


ate 


cac 


ttc 


tgc 


gee 


ccc 


gee 


ggc 


ttc 


576 


gec 


ate 


ctg 


aag 


tgc 


aac 


aac 


aag 


acc 


ttc 


aac 


ggc 


acc 


ggc 


ccc 


tgc 


624 


acc 


aac 


gtg 


age 


acc 


gtg 


cag 


tgc 


acc 


cac 


gga 


att 


cgc 


ccc 


gtg 


gtg 


672 


age 


acc 


cag 


ctg 


ctg 


ctg 


aac 


ggc 


age 


ctg 


gec 


gag 


gag 


gag 


gtg 


gtg 


720 


ate 


aga 


tct 


gag 


aac 


ttc 


acc 


aac 


aac 


gec 


aag 


acc 


ate 


ate 


gtg 


cag 


768 


ctg 


aac 


gag 


age 


gtg 


gag 


ate 


aac 


tgc 


acc 


cgc 


ccc 


aac 


aac 


aac 


acc 


816 


cgc 


aag 


age 


ate 


cac 


ate 


ggc 


cct 


ggc 


cgc 


gec 


ttc 


tac 


acc 


acc 


ggc 


864 


gac 


ate 


ate 


ggc 


gac 


ate 


cgc 


cag 


gee 


cac 


tgc 


aac 


ate 


tct 


aga 


acc 


912 


aac 


tgg 


acc 


aac 


acc 


ctg 


aag 


cgc 


gtg 


gec 


gag 


aag 


ctg 


cgc 


gag 


aag 


960 


ttc 


aac 


aac 


acc 


acc 


ate 


gtg 


ttc 


aac 


cag 


age 


tec 


ggc 


ggc 


gac 


ccc 


1008 


gag 


ate 


gtg 


atg 


cac 


age 


ttc 


aac 


tgc 


ggc 


ggc 


gag 


ttc 


ttc 


tac 


tgc 


1056 


aac 


acc 


acc 


cag 


ctg 


ttc 


aac 


age 


acc 


tgg 


aac 


gag 


acc 


aac 


age 


gag 


1104 


ggc 


aac 


ate 


act 


agt 


ggc 


acc 


ate 


acc 


ctg 


ccc 


tgc 


cgc 


ate 


aag 


cag 


1152 


ate 


ate 


aac 


atg 


tgg 


cag 


gag 


gtg 


ggc 


aag 


gee 


atg 


tac 


gee 


ccc 


ccc 


1200 


ate 


ggc 


ggc 


cag 


ate 


aag 


tgc 


ctg 


age 


aac 


ate 


acc 


ggc 


ctg 


czg 


ctg 


1248 


acc 


cgc 


gac 


ggc 


ggc 


age 


gac 


aac 


teg 


ag 














1277 




<210> 


52 






























<211> 


425 






























<212> 


PRT 






























<213> 


Artificial Sequence 





















<400> 52 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 








20 










25 










30 






Ala 


Tyr Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 








Pro 


Thr 


Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 








70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 



85 90 95 
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Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys Leu 


Lys 


Asn 


Ser 


Thr 


Asp 








100 










105 








110 






I nr 


Asn 


Asn 


Thr 


Arg 


Trp Gly Thr 


Gin 


Glu Met 


Lys 


Asn 


Cys 


Ser 


Phe 






115 










120 








125 








Gin 


He 


Ser 


Thr 


Ser 


Val 


Arg Asn 


Lys 


Met Lys 


Arg 


Glu 


Tyr 


Ala 


Leu 




130 










IOC 

135 








140 










Pne 


Tyr 


Ser 


Leu Asp 


He 


17*1 1 

Val 


Pro 


He 


Asp Asn Asp Gin Thr 


Ser 


Tyr 


145 










150 








155 










160 


Arg 


Leu 


Arg 


Ser 


Cys 


Asn 


Thr 


Ser 


He 


He Thr 


Gin 


Ala 


Cys 


Pro 


Lys 










165 










170 








175 




val 


Ser 


Pne 


Glu 


Pro 


He 


Pro 


lie 


His 


Phe Cys Ala 


Pro 


Ala 


Gly 


Phe 








180 










185 








190 








He 


Leu 


Lys 


Cys 


Asn 


Asn 


Lys 


Thnr-Phe Asn 


Gly Thr 


Gly 


Pro 


Cys 






195 










200 








205 








Thr 


Asn 


Val 


Ser 


Thr 


Val 


Gin 


Cys 


Thr 


His Gly 


lie Arg 


Pro 


Val 


Val 




210 










215 








220 










Ser 


Tnr 


Gin 


Leu 


Leu 


Leu 


Asn 


Gly 


Ser 


Leu Ala 


Glu 


Glu 


Glu 


Val 


Val 


ZZ5 










230 








235 










240 


lie 


Arg 


Ser 


Glu 


Asn 


Phe 


Thr 


Asn 


Asn 


Ala Lys 


Thr 


He 


He 


Val 


Gin 










245 










250 








255 




Leu 


Asn 


Glu 


Ser 


Val 


Glu 


He 


Asn 


Cys 


Thr Arg 


Pro 


Asn 


Asn 


Asn 


Thr 








260 










265 








270 






Arg 


Lys 


Ser 


He 


His 


He 


Gly 


Pro Gly Arg Ala 


Phe 


Tyr 


Thr 


Thr 


Gly 






2/5 










280 








285 








Asp 


He 


He 


Gly Asp 


He 


Arg 


Gin 


Ala 


His Cys 


Asn 


He 


Ser 


Arg 


Thr 




O Q O 

290 










295 








300 










Asn 


Trp 


Tnr 


Asn 


Thr 


Leu 


Lys 


Arg 


Val 


Ala Glu 


Lys 


Leu 


Arg 


Glu 


Lys 


■JAC 

JU5 










310 








315 










320 


Pne 


Asn 


Asn 


Thr 


Thr 


lie 


Val 


Phe 


Asn 


Gin Ser 


Ser 


Gly Gly Asp 


Pro 










325 










330 








335 




Glu 


lie 


Val 


Met 


His 


Ser 


Phe 


Asn 


Cys 


Gly Gly Glu 


Phe 


Phe 


Tyr 


Cys 








340 










345 








350 






Asn 


Thr 


Thr 


Gin 


Leu 


Phe 


Asn 


Ser 


Thr 


Trp Asn 


Glu 


Thr 


Asn 


Ser 


Glu 






355 










360 








365 








Gly 


Asn 


He 


Thr 


Ser 


Gly Thr 


lie 


Thr 


Leu Pro 


Cys Arg 


He 


Lys 


Gin 




370 










375 








380 










He 


He 


Asn 


Met 


Trp 


Gin 


Glu 


Val 


Gly 


Lys Ala 


Met 


Tyr 


Ala 


Pro 


Pro 


385 










390 








395 










400 


He 


Gly 


Gly 


Gin 


He 


Lys 


Cys 


Leu 


Ser 


Asn He 


Thr 


Gly 


Leu 


Leu 


Leu 










405 










410 








415 




Thr 


Arg 


Asp 


Gly Gly 


Ser 


Asp 


Asn 


Ser 















420 425 

<210> 53 
<211> 432 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (432) 

<400> 53 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 48 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 
15 10 15 

etc egg caa get gat ctg cac cac ggc cgt gec ctg gaa cgc cag ctg 96 
Leu Arg Gin Ala Asp Leu His His Gly Arg Ala Leu Glu Arg Gin Leu 
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20 25 30 

gag caa caa gaa cct gag cca gat ttg gga caa cat gac ctg gat gga 14 4 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 

35 40 45 

gtg gga gcg cga gat cag caa eta cac cga gat cat eta cag cct gat 192 

Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 

50 55 60 

cga gga gag cca gaa cca gca gga gaa gaa cga get gga cct get cca 240 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 

~55 70 75 80 

get gga caa gtg ggc aag ctt gtg gaa ctg gtt caa cat cac caa ctg 288 

Ala Gly Gin Val Gly Lys Leu Val Glu Leu Val Gin His His Gin Leu 

85 90 95 

get gtg gta cat caa gat ttt cat cat gat cgt ggg egg cct gat egg 336 

Ala Val Val His Gin Asp Phe His His Asp Arg Gly Arg Pro Asp Arg 

100 105 110 

cct gcg cat cgt gtt cac cgt get gag cat cgt gaa ccg cgt gcg cca 384 

Pro Ala His Arg Val His Arg Ala Glu His Arg Glu Pro Arg Ala Pro 

115 120 125 

ggg eta cag ccc cct gag ctt cca gac ccg cct gec cgt gec ccg egg 432 

Gly Leu Gin Pro Pro Glu Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 

130 135 140 



<210> 54 
<211> 144 
<212> PRT 

<213> Artificial Sequence 
<400> 54 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 

15 10 15 

Leu Arg Gin Ala Asp Leu His His Gly Arg Ala Leu Glu Arg Gin Leu 

20 25 30 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu Asp Gly 

35 40 45 

Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 

50 55 60 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 
65 70 75 80 

Ala Gly Gin Val Gly Lys Leu Val Glu Leu Val Gin His His Gin Leu 

85 90 95 

Ala Val Val His Gin Asp Phe His His Asp Arg Gly Arg Pro Asp Arg 

100 105 110 

Pro Ala His Arg Val His Arg Ala Glu His Arg Glu Pro Arg Ala Pro 

115 120 125 

Gly Met Gin Pro Pro Glu Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 
130 135 140 



<210> 55 
<211> 434 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (434) 

<400> 55 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 4 8 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 
15 10 15 

96 



144 



etc 


egg 


caa 


get 


gat 


ctg 


cac 


cac 


ggc 


cgt 


gec 


ctg 


gaa 




cag 


ctg 


•iie-u 


Arg 


Gin 


Ala 


Asp 


Leu 


His 


His 


Glry^Arg 


Ala 


Leu 


Glu 


Arg 


Gin 


Leu 








20 










C. D 










30 






gag 


caa 


caa 


gaa 


cct 


gag 


cca 


gat 


ttg 


gga 


caa 


cat 


gac 


+* 

ctg 


gat 


gga 


Glu 


Gin 


Gin 


Glu 


Pro 


Glu 


Pro 


Asp 


T ah 


Gly Gin 


His 


Asp 


Leu 


Asp 


Gly 






35 










40 










45 








gtg 


gga 


gcg 


cga 


gat 


cag 


caa 


eta 


cac 


cga 


gat 


cat 


eta 


cag 


cct 


gat 


Val 


Gly Ala 


Arg 


Asp 


Gin 


Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 


Pro 


Asp 




50 










55 










60 










cga 


gga 


gag 


cca 


gaa 


cca 


gca 


gga 


gaa 


gaa 


cga 


get 


gga 


cct 


get 


cca 


Arg 


Gly Glu 


Pro 


Glu 


Pro 


Ala 


Gly 




Glu 


Arg 


Ala 


Gly 


rro 


Ala 


Pro 


65 










70 










75 










80 


get 


gga 


caa 


gtg 


ggc 


aag 


ctt 


gtg 


gaa 


ctg 


gtt 


caa 


cat 


cac 


caa 


ctg 


Ala 


Gly 


Gin 


Val 


Gly Lys 


Leu 


Val 


Glu 


Leu 


Val 


Gin 


His 


His 


Gin 


Leu 










85 










90 










95 




get 


gtg 


gta 


cat 


caa 


gat 


ttt 


cat 


cat 


gat 


cgt 


ggg 


egg 


cct 


gat 


egg 


Ala 


Val 


Val 


His 


Gin Asp 


Phe 


His 


HIS 


Asp Arg 


Gly Arg 


Pro 


Asp Arg 








100 










1 AC 

100 










110 






cct 


gcg 


cat 


cgt 


gtt 


cac 


cgt 


get 


gag 


cat 


cgt 


gaa 


ccg 


A 4" 

cgt 


gcg 


cca 


Pro 


Ala 


His 


Arg 


Val 


His 


Arg 


Ala 


Glu 


His 


Arg 


Glu 


Pro 


Arg 


Ala 


Pro 






115 










120 










125 








ggg 


atg 


cag 


ccc 


cct 


gag 


ctt 


cca 


gac 


ccg 


cct 


gec 


cgt 


gtg 


acg 


gat 


Gly 


Met 


Gin 


Pro 


Pro 


Glu 


Leu 


Pro 


Asp 


Pro 


Pro 


Ala 


Arg 


Val 


Thr 


Asp 




130 










135 










140 










cc 


































<210> 


56 




























<211> 


144 




























<212> 


PRT 




























<213> 


Artificial Sequence 




















<400> 


56 


























Ser 


Arg 


Ala 


Leu 


Pro 


Pro 


Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 


Gly 


Leu 


1 








5 










10 










15 




Leu 


Arg 


Gin 


Ala 


Asp 


Leu 


His 


His 


Gly 


Arg Ala 


Leu 


Glu 


Arg 


Gin 


Leu 








20 










25 










30 






Glu 


Gin 


Gin 


Glu 


Pro 


Glu 


Pro 


Asp 


Leu 


Gly 


Gin 


His 


Asp 


Leu 


Asp 


Gly 






35 










40 










45 








Val 


Gly 


Ala 


Arg 


Asp 


Gin 


Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 


Pro 


Asp 




50 










55 










60 











192 



240 



288 



336 



384 



432 



434 
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Arg 


Gly 


Glu 


Pro 


Glu 


Pro 


Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 


Ala Pro 


65 










70 










75 








80 


Ala 


Gly 


Gin 


Val 


Gly 
85 


Lys 


Leu 


Val 


Glu 


Leu 
90 


Val 


Gin 


His 


His 


Gin Leu 
95 


Ala 


Val 


Val 


His 
100 


Gin 


Asp 


Phe 


His 


His 
105 


Asp 


Arg 


Gly 


Arg 


Pro 
110 


Asp Arg 


Pro 


Ala 


His 


Arg 


Val 


His 


Arg 


Ala 


Glu 


His 


Arg 


Glu 


Pro Arg 


Ala Pro 






115 










120 










125 






Gly 


Met 
130 


Gin 


Pro 


Pro 


Glu 


Leu 
135 


Pro 


Asp 


Pro 


Pro 


Ala 
140 


Arg 


Val 


Thr Asp 



<210> 57 
<211> 281 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)...{281) 

<400> 57 

tct aga gcg eta cct cca gga cca gcg ctt cct ggg cat gtg ggg ctg 4 8 

Ser Arg Ala Leu Pro Pro Gly Pro Ala Leu Pro Gly His Val Gly Leu 
15 10 15 

etc egg caa get gat ctg cac cac ggc cgt gec ctg gaa cgc cag ctg 96 
Leu Arg Gin Ala Asp Leu His His Gly Arg Ala Leu Glu Arg Gin Leu 
20 25 30 

gag caa caa gaa cct gag cca gat ttg gga caa cat gac ctg gat gga 14 4 

Glu Gin Gin Glu Pro Glu Pro Asp Leu Gly Gin His Asp Leu- Asp Gly 
35 40 45 

gtg gga gcg cga gat cag caa eta cac cga gat cat eta cag cct gat 192 
Val Gly Ala Arg Asp Gin Gin Leu His Arg Asp His Leu Gin Pro Asp 
50 55 60 

cga gga gag cca gaa cca gca gga gaa gaa cga get gga cct get cca 24 0 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 
65 70 75 80 

get gga caa gtg ggc aag ctt gtg tga ctg att gag gat cc 281 
Ala Gly Gin Val Gly Lys Leu Val * Leu lie Glu Asp 

85 90 





<210> 


58 










<211> 


92 










<212> 


PRT 










<213> 


Artificial 


Sequence 




<400> 


58 








Ser 


Arg Ala 


Leu 


Pro 


Pro 


Gly Pro 


1 






5 






Leu 


Arg Gin 


Ala 


Asp 


Leu 


His His 






20 








Glu 


Gin Gin 


Glu 


Pro 


Glu 


Pro Asp 




35 








40 


Val 


Gly Ala 


Arg 


Asp 


Gin 


Gin Leu 



Ala 


Leu 


Pro 


Gly 


His 


Val 


Gly Leu 




10 










15 


Gly 


Arg 


Ala 


Leu 


Glu 


Arg 


Gin Leu 


25 










30 




Leu 


Gly 


Gin 


His 


Asp 


Leu 


Asp Gly 










45 






His 


Arg 


Asp 


His 


Leu 


Gin 


Pro Asp 
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50 55 60 

Arg Gly Glu Pro Glu Pro Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro 
65 70 75 80 

Ala Gly Gin Val Gly Lys Leu Val Leu lie Glu Asp 
85 90 

<210> 59 
<211> 272 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (272) 

<400> 59 

ate gat tgc gcg acc tgc tgc tga teg tgg ccc gca teg tgg age tgc 48 

lie Asp Cys Ala Thr Cys Cys * Ser Trp Pro Ala Ser Trp Ser Cys 
15 10 15 

tgg gee ggc gcg get ggg aga tec tga agt act ggt gga acc tgc tec 96 
Trp Ala Gly Ala Ala Gly Arg Ser * Ser Thr Gly Gly Thr Cys Ser 

20 25 30 

agt act gga gee agg age tga aga act ctg cag tga gec tgc tga acg 14 4 

Ser Thr Gly Ala Arg Ser * Arg Thr Leu Gin * Ala Cys * Thr 

35 40 

cca ccg cca teg ccg tgg ccg agg gca ccg acc gcg tga teg agg tgg 192 
Pro Pro Pro Ser Pro Trp Pro Arg Ala Pro Thr Ala * Ser Arg Trp 
45 50 55 

tgc age gca tct ggc gcg gca tec tgc aca tec cca ccc gaa ttc gee 240 
Cys Ser Ala Ser Gly Ala Ala Ser Cys Thr Ser Pro Pro Glu Phe Ala 
60 65 70 

agg get teg age gcg ccc tgc tgt aag gat cc 272 
Arg Ala Ser Ser Ala Pro Cys Cys Lys Asp 
75 80 



<210> 60 
<211> 84 
<212> PRT 

<213> Artificial Sequence 

<400> 60 
He Asp Cys Ala Thr Cys Cys Ser 

1 5 
Ala Gly Ala Ala Gly Arg Ser Ser 
20 

Gly Ala Arg Ser Arg Thr Leu Gin 

35 40 
Trp Pro Arg Ala Pro Thr Ala Ser 

50 55 
Ala Ser Cys Thr Ser Pro Pro Glu 
65 70 
Cys Cys Lys Asp 



Trp Pro Ala Ser Trp Ser Cys Trp 

10 15 
Thr Gly Gly Thr Cys Ser Ser Thr 
25 30 
Ala Cys Thr Pro Pro Pro Ser Pro 
45 

Arg Trp Cys Ser Ala Ser Gly Ala 
60 

Phe Ala Arg Ala Ser Ser Ala Pro 
75 80 
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<210> 61 
<211> 798 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (798) 

<400> 61 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 48 

Eeu Glu Gin Arg Gin Gly Asp Phe PrcTTro Arg Arg Arg Arg His Ala 
15 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

cga gaa gcg cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc agg cag cac cat ggg cgc cgc cag cct gac cct gac cgt gca 240 
Arg Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg cca get get gag egg cat cgt gca gca gca gaa caa cct get 288 
Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 

85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gac cgt gtg ggg 336 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 110 

cat caa gca get cca ggc ccg cgt get ggc tct aga gcg eta cct cca 384 
His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 
115 120 125 

gga cca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 432 
Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 

cac cac ggc cgt gec ctg gaa cgc cag ctg gag caa caa gaa cct gag 480 
His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

cca gat ttg gga caa cat gac ctg gat gga gtg gga gcg cga gat cag 528 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 175 

caa eta cac cga gat cat eta cag cct gat cga gga gag cca gaa cca 57 6 

Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 
180 185 190 
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gca gga gaa 
Ala Gly Glu 
195 

ctt gtg gaa 
Leu Val Glu 
210 

ttt cat cat 
Phe His His 
225 

c"gt get gag 
Arg Ala Glu 



ctt cca gac 
Leu Pro Asp 



gaa cga get 
Glu Arg Ala 



ctg gtt caa 
Leu Val Gin 



gat cgt ggg 
Asp Arg Gly 
230 

cat cgt gaa 
His Arg Glu 
245 

ccg cct gec 
Pro Pro Ala 
260 



gga cct get 
Gly Pro Ala 
200 

cat cac caa 
His His Gin 
215 

egg cct gat 
Arg Pro Asp 



ccg cgt gcg 
Pro Arg Ala 



cgt gec ccg 
Arg Ala Pro 
265 



cca get gga 
Pro Ala Gly 



ctg get gtg 
Leu Ala Val 
220 

egg cct gcg 
Arg Pro Ala 
235 

cca ggg eta 
Pro Gly Leu 
250 

egg 
Arg 



caa gtg ggc 
Gin Val Gly 
205 

gta cat caa 
Val His Gin 



cat cgt gtt 
His Arg Val 



cag ccc cct 
Gin Pro Pro 
255 



aag 624 
Lys 



gat 672 
Asp 



cac 720 

His 

240 

gag 7 68 

Glu 



798 



<210> 62 
<211> 266 
<212> PRT 
<213> Arti 

<400> 62 
Leu Glu Gin Arg 
1 

Arg Gin Leu Ala 
20 

Ala Pro Gly His 
35 

Arg Glu Ala Arg 
50 

Arg Cys Arg Gin 
65 

Gly Pro Pro Ala 

Ala Arg His Arg 
100 

His Gin Ala Ala 
115 

Gly Pro Ala Leu 
130 

His His Gly Arg 
145 

Pro Asp Leu Gly 

Gin Leu His Arg 
180 

Ala Gly Glu Glu 
195 

Leu Val Glu Leu 
210 

Phe His His Asp 
225 

Arg Ala Glu His 



ficial Sequence 



Gin 


Gly 


Asp 


Phe 


5 








Gin 


Arg 


Ala 


Val 


Arg 


Pro 


His 


Gin 








40 


Arg 


Gly 


His 


Arg 






55 




His 


His 


Gly 


Arg 




70 






Ala 


Glu 


Arg 


His 


85 








Gly 


Pro 


Ala 


Ala 


Pro 


Gly 


Pro 


Arg 








120 


Pro 


Gly 


His 


Val 






135 




Ala 


Leu 


Glu 


Arg 




150 






Gin 


His 


Asp 


Leu 


165 








Asp 


His 


Leu 


Gin 


Arg 


Ala 


Gly 


Pro 








200 


Val 


Gin 


His 


His 






215 




Arg 


Gly 


Arg 


Pro 




230 






Arg 


Glu 


Pro 


Arg 



Pro 


Pro 


Arg 


Arg 




10 






Gin 


Val 


Gin 


Gly 


25 








Gly 


Gin 


Ala 


Pro 


Arg 


Tyr 


Val 


Pro 








60 


Arg 


Gin 


Pro 


Asp 






75 




Arg 


Ala 


Ala 


Ala 




90 






Pro 


Ala 


Pro 


Ala 


105 








Ala 


Gly 


Ser 


Arg 


Gly 


Leu 


Leu 


Arg 








140 


Gin 


Leu 


Glu 


Gin 






155 




Asp 


Gly 


Val 


Gly 




170 






Pro 


Asp 


Arg 


Gly 


185 








Ala 


Pro 


Ala 


Gly 


Gin 


Leu 


Ala 


Val 








220 


Asp 


Arg 


Pro 


Ala 






235 




Ala 


Pro 


Gly 


Leu 




250 







Arg 


Arg 


His 


Ala 






15 




Gly 


Glu 


Asp 


Arg 




30 






Arg 


Gly 


Ala 


Ala 


45 








Arg 


Leu 


Pro 


Gly 


Pro 


Asp 


Arg 


Ala 








80 


Glu 


Gin 


Pro 


Ala 






95 




Asp 


Arg 


Val 


Gly 




110 






Ala 


Leu 


Pro 


Pro 


125 








Gin 


Ala 


Asp 


Leu 


Gin 


Glu 


Pro 


Glu 








160 


Ala 


Arg 


Asp 


Gin 






175 




Glu 


Pro 


Glu 


Pro 




190 






Gin 


Val 


Gly 


Lys 


205 








Val 


His 


Gin 


Asp 


His 


Arg 


Val 


His 








240 


Gin 


Pro 


Pro 


Glu 






255 
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Leu Pro Asp Pro Pro Ala Arg Ala Pro Arg 
260 265 

<210> 63 
<211> 800 
<212> DNA 

<213> Artificial Sequence 





<220> 
<221> 
<222> 


CDS 
(1) 


. . . (800) 






















etc 
Leu 
1 


<400> 
gag cag 
Glu Gin 


63 

egg 

Arg 


caa gga 
Gin Gly 
5 


gat 
Asp 


ttt 
Phe 


ccg 
Pro 


ccc 
Pro 
10 


egg 
Arg 


egg 
Arg 


egg 
Arg 


cga 
Arg 


cat 
His 
15 


gcg 
Ala 



48 



cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 96 
Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 14 4 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

cga gaa gcg cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 192 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc agg cag cac cat ggg cgc cgc cag cct gac cct gac cgt gca 240 
Arg Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg cca get get gag egg cat cgt gca gca gca gaa caa cct get 288 
Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 

85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gac cgt gtg ggg 336 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 110 

cat caa gca get cca ggc ccg cgt get ggc tct aga gcg eta cct cca 384 
His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 
115 120 125 

gga cca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 432 
Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 

cac cac ggc cgt gee ctg gaa cgc cag ctg gag caa caa gaa cct gag 480 
His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

cca gat ttg gga caa cat gac ctg gat gga gtg gga gcg cga gat cag 528 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 175 



caa eta cac cga gat cat eta cag cct gat cga gga gag cca gaa cca 
Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 



576 
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gca gga gaa 
Ala Gly Glu 
195 

ctt gtg gaa 
Lea Val Glu 
210 

ttt cat cat 
Phe His His 

cgt get gag 
Arg Ala Glu 



ctt cca gac 
Leu Pro Asp 



180 

gaa cga get 
Glu Arg Ala 



ctg gtt caa 
Leu Val Gin 



gat cgt ggg 
Asp Arg Gly 
230 

cat cgt gaa 
His Arg Glu 
245 

ccg cct gec 
Pro Pro Ala 
260 



185 

gga cct get 
Gly Pro Ala 
200 

cat cac caa 
His His Gin 
215 

egg cct gat 
Arg Pro Asp 



ccg cgt gcg 
Pro Arg Ala 



cgt gtg acg 
Arg Val Thr 
265 



cca get gga 
Pro Ala Gly 



ctg get gtg 
Leu Ala Val 
220 

egg cct gcg 
Arg Pro Ala 
235 

cca ggg atg 
Pro Gly Met 
250 

gat cc 
Asp 



190 

caa gtg ggc 
Gin Val Gly 
205 

gta cat caa 
Val His Gin 



cat cgt gtt 
His Arg Val 



cag ccc cct 
Gin Pro Pro 
255 



aag 624 
Lys 



gat 672 
Asp 



cac 720 

His 

240 

gag 768 
Glu 



800 



<210> 64 
<211> 266 
<212> PRT 

<213> Artificial Sequence 



<400> 64 



Leu 


Glu 


Gin 


Arg 


Gin 


Gly 


Asp 


Phe 


1 








5 








Arg 


Gin 


Leu 


Ala 


Gin 


Arg 


Ala 


Val 








20 










Ala 


Pro 


Gly 


His 


Arg 


Pro 


His 


Gin 






35 










40 


Arg 


Glu 


Ala 


Arg 


Arg 


Gly 


His 


Arg 




50 










55 




Arg 


Cys 


Arg 


Gin 


His 


His 


Gly 


Arg 


65 










70 






Gly 


Pro 


Pro 


Ala 


Ala 


Glu 


Arg 


His 










85 








Ala 


Arg 


His 


Arg 


Gly 


Pro 


Ala 


Ala 








100 










His 


Gin 


Ala 


Ala 


Pro 


Gly 


Pro 


Arg 






115 










120 


Gly 


Pro 


Ala 


Leu 


Pro 


Gly 


His 


Val 




130 










135 




His 


His 


Gly 


Arg 


Ala 


Leu 


Glu 


Arg 


145 










150 






Pro 


Asp 


Leu 


Gly 


Gin 


His 


Asp 


Leu 










165 








Gin 


Leu 


His 


Arg 


Asp 


His 


Leu 


Gin 








180 










Ala 


Gly 


Glu 


Glu 


Arg 


Ala 


Gly 


Pro 






195 










200 


Leu 


Val 


Glu 


Leu 


Val 


Gin 


His 


His 




210 










215 




Phe 


His 


His 


Asp 


Arg 


Gly 


Arg 


Pro 


225 










230 







Pro 


Pro 


Arg Arg 


Arg Arg 


His 


Ala 




10 










15 




Gin 


Val 


Gin 


Gly 


Gly Glu Asp 


Arg 


25 










30 






Gly 


Gin 


Ala 


Pro 


Arg 


Gly Ala 


Ala 










45 








Arg 


Tyr 


Val 


Pro 


Arg 


Leu 


Pro 


Gly 








60 










Arg 


Gin 


Pro 


Asp 


Pro 


Asp 


Arg 


Ala 






75 










80 


Arg 


Ala 


Ala 


Ala 


Glu 


Gin 


Pro 


Ala 




90 










95 




Pro 


Ala 


Pro 


Ala 


Asp Arg 


Val 


Gly 


105 










110 






Ala 


Gly 


Ser 


Arg 


Ala 


Leu 


Pro 


Pro 










125 








Gly 


Leu 


Leu 


Arg 


Gin 


Ala 


Asp 


Leu 








140 










Gin 


Leu 


Glu 


Gin 


Gin 


Glu 


Pro 


Glu 






155 










160 


Asp 


Gly 


Val 


Gly 


Ala 


Arg 


Asp 


Gin 




170 










175 




Pro 


Asp 


Arg 


Gly 


Glu 


Pro 


Glu 


Pro 


185 










190 






Ala 


Pro 


Ala 


Gly 


Gin 


Val 


Gly 


Lys 










205 








Gin 


Leu 


Ala 


Val 


Val 


His 


Gin 


Asp 








220 










Asp 


Arg 


Pro 


Ala 


His 


Arg 


Val 


His 






235 










240 
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Arq Ala Glu His Arg Glu Pro Arg Ala Pro Gly Met Gin Pro Pro Glu 

245 250 255 

Leu Pro Asp Pro Pro Ala Arg Val Thr Asp 
260 265 

<210> 65 

<211> 64*7 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (647) 

<400> 65 

etc gag cag egg caa gga gat ttt ccg ccc egg egg egg cga cat gcg 
Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 
3/5 10 15 

cga caa ctg gcg cag cga get gta caa gta caa ggt ggt gaa gat cga 
Arq Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 
20 25 30 

gec cct ggg cat cgc ccc cac caa ggc caa gcg ccg cgt ggt gca gcg 
Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 
35 40 45 

cga gaa gcg cgc cgt ggg cat egg cgc tat gtt cct egg ctt cct ggg 
Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 
50 55 60 

cgc tgc agg cag cac cat ggg cgc cgc cag cct gac cct gac cgt gca 
Arg Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

ggc ccg cca get get gag egg cat cgt gca gca gca gaa caa cct get 
Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 

85 90 95 

gcg cgc cat cga ggc cca gca gca cct get cca get gac cgt gtg ggg 
Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 
100 105 HO 

cat caa gca get cca ggc ccg cgt get ggc tct aga gcg eta cct cca 
His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 
115 120 125 

gga cca gcg ctt cct ggg cat gtg ggg ctg etc egg caa get gat ctg 
Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 
130 135 140 

cac cac ggc cgt gee ctg gaa cgc cag ctg gag caa caa gaa cct gag 
His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

cca gat ttg gga caa cat gac ctg gat gga gtg gga gcg cga gat cag 
Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 
165 170 175 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 
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caa eta cac cga gat cat eta cag cct gat cga gga gag cca gaa cca 

Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 
180 185 190 

gca gga gaa gaa cga get gga cct get cca get gga caa gtg ggc aag 

Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro Ala Gly Gin Val Gly Lys 
195 200 205 

ctt gtg tga ctg att gag gat cc 

Leu Val * Leu lie Glu Asp 
210 



<210> 66 
<211> 214 
<212> PRT 

<213> Artificial Sequence 
<400> 66 

Leu Glu Gin Arg Gin Gly Asp Phe Pro Pro Arg Arg Arg Arg His Ala 

15 10 15 

Arg Gin Leu Ala Gin Arg Ala Val Gin Val Gin Gly Gly Glu Asp Arg 

20 25 30 

Ala Pro Gly His Arg Pro His Gin Gly Gin Ala Pro Arg Gly Ala Ala 

35 40 45 

Arg Glu Ala Arg Arg Gly His Arg Arg Tyr Val Pro Arg Leu Pro Gly 

50 55 60 

Arq Cys Arg Gin His His Gly Arg Arg Gin Pro Asp Pro Asp Arg Ala 
65 70 75 80 

Gly Pro Pro Ala Ala Glu Arg His Arg Ala Ala Ala Glu Gin Pro Ala 

85 90 95 

Ala Arg His Arg Gly Pro Ala Ala Pro Ala Pro Ala Asp Arg Val Gly 

100 105 HO 

His Gin Ala Ala Pro Gly Pro Arg Ala Gly Ser Arg Ala Leu Pro Pro 

115 120 125 

Gly Pro Ala Leu Pro Gly His Val Gly Leu Leu Arg Gin Ala Asp Leu 

130 135 140 

His His Gly Arg Ala Leu Glu Arg Gin Leu Glu Gin Gin Glu Pro Glu 
145 150 155 160 

Pro Asp Leu Gly Gin His Asp Leu Asp Gly Val Gly Ala Arg Asp Gin 

165 170 175 

Gin Leu His Arg Asp His Leu Gin Pro Asp Arg Gly Glu Pro Glu Pro 

180 185 190 

Ala Gly Glu Glu Arg Ala Gly Pro Ala Pro Ala Gly Gin Val Gly Lys 

195 200 205 

Leu Val Leu He Glu Asp 
210 

<210> 67 
<211> 1918 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> {!)... (1918) 



<400> 67 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 



48 
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96 



144 



192 



240 



288 



336 



384 



Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 

gtg tgg aag gac gcc acc acc acc ctg ttc tgc gcc age gac gec aag 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gcc tac gac acc gag gtg cac aac gtg tgg gcc acc cac gcg tgc gtg 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 
-Pro Thr Asp Pro Asn Pro Gin Glu Varl--Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 
He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 HO 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gcc ctg 432 
Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 
Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gcc tgc ccc aag 
Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 
165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gcc ccc gcc ggc ttc 
Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gcc ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 
Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gcc gag gag gag gtg gtg 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 



480 



528 



576 



624 



672 



720 
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ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 
He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 
Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gec ttc tac acc acc ggc 
Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

Vac ate ate ggc gac ate cgc cag gcc-cac tgc aac ate tct aga acc 
Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gee gag aag ctg cgc gag aag 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 
Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 
Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 
Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 
Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg lie Lys Gin 
370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gec atg tac gee ccc ccc 
He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 
He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
405 410 415 

acc cgc gac ggc ggc age gac aac teg age age ggc aag gag att ttc 
Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu He Phe 
420 425 430 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



cgc ccc ggc ggc ggc gac atg cgc gac aac tgg cgc age gag ctg tac 134 4 

Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gec ccc acc aag 1392 
Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly lie Ala Pro Thr Lys 
450 455 460 



gee aag cgc cgc gtg gtg cag cgc gag aag cgc gec gtg ggc ate ggc 
Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly 
465 470 475 480 



1440 
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get atg ttc etc ggc ttc ctg ggc get gca ggc age ace atg ggc gec 1488 
Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
485 490 495 

gec age ctg acc ctg acc gtg cag gee cgc cag ctg ctg age ggc ate 1536 
Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly lie 
500 505 510 

gtg cag cag cag aac aac ctg ctg cgc gee ate gag gec cag cag cac 1584 
Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His 
515 . 520 525 

ctg etc cag ctg acc gtg tgg ggc ate aag cag etc cag gee cgc gtg 
Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val 
530 535 540 

ctg get eta gag cgc tac etc cag gac cag cgc ttc ctg ggc atg tgg 
Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 555 560 

ggc tgc tec ggc aag ctg ate tgc acc acg gec gtg ccc tgg aac gec 
Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 
565 570 575 

age tgg age aac aag aac ctg age cag att tgg gac aac atg acc tgg 1776 
Ser Trp Ser Asn Lys Asn Leu Ser Gin He Trp Asp Asn Met Thr Trp 
580 585 590 



atg gag tgg gag cgc gag ate age aac tac acc gag ate ate tac age 
Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu He He Tyr Ser 
595 600 605 



etc cag ctg gac aag tgg gca age ttg tgt gac tga ttg agg ate c 
Leu Gin Leu Asp Lys Trp Ala Ser Leu Cys Asp + Leu Arg lie 
625 630 635 



<210> 68 
<211> 638 
<212> PRT 

<213> Artificial Sequence 
<400> 68 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

15 10 15 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 

35 40 45 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 

50 55 60 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp- 
65 70 75 80 

He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 



1632 



1680 



1728 



1824 



ctg ate gag gag age cag aac cag cag gag aag aac gag ctg gac ctg 1872 
Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 
610 615 620 



1918 
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Pro Leu Cys Val 
100 

Thr Asn Asn Thr 
115 

Asn He Ser Thr 
130 

Phe Tyr Ser Leu 
145 

Arg Leu Arg Ser 

Val . Ser Phe Glu 
180 

Ala He Leu Lys 
195 

Thr Asn Val Ser 
210 

Ser Thr Gin Leu 
225 

He Arg Ser Glu 

Leu Asn Glu Ser 
260 

Arg Lys Ser He 
275 

Asp He He Gly 
290 

Asn Trp Thr Asn 
305 

Phe Asn Asn Thr 

Glu He Val Met 
340 

Asn Thr Thr Gin 
355 

Gly Asn He Thr 
370 

He lie Asn Met 
385 

He Gly Gly Gin 

Thr Arg Asp Gly 
420 

Arg Pro Gly Gly 
435 

Lys Tyr Lys Val 
450 

Ala Lys Arg Arg 
465 

Ala Met Phe Leu 

Ala Ser Leu Thr 
500 

Val Gin Gin Gin 
515 

Leu Leu Gin Leu 
530 

Leu Ala Leu Glu 
545 



85 

Thr Leu Asn Cys 

Arg Trp Gly Thr 
120 

Ser Val Arg Asn 
135 

Asp lie Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro He Pro lie 

Cys Asn Asn Lys 
200 

Thr Val Gin Cys 
215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 

Val Glu lie Asn 

His He Gly Pro 
280 

Asp lie Arg Gin 
295 

Thr Leu Lys Arg 
310 

Thr lie Val Phe 
325 

His Ser Phe Asn 

Leu Phe Asn Ser 
360 

Ser Gly Thr lie 
375 

Trp Gin Glu Val 
390 

lie Lys Cys Leu 
405 

Gly Ser Asp Asn 

Gly Asp Met Arg 
440 

Val Lys He Glu 
455 

Val Val Gin Arg 
470 

Gly Phe Leu Gly 
485 

Leu Thr Val Gin 

Asn Asn Leu Leu 
520 

Thr Val Trp Gly 
535 

Arg Tyr Leu Gin 
550 



40 



90 

Thr Lys Leu Lys 
105 

Gin Glu Met Lys 

Lys Met Lys Arg 
140 

He Asp Asn Asp 
155 

lie lie Thr Gin 
170 

His Phe Cys Ala 
185"" 

Thr Phe Asn Gly 

Thr His Gly lie 
220 

Ser Leu Ala Glu 
235 

Asn Ala Lys Thr 
250 

Cys Thr Arg Pro 
265 

Gly Arg Ala Phe 

Ala His Cys Asn 
300 

Val Ala Glu Lys 
315 

Asn Gin Ser Ser 
330 

Cys Gly Gly Glu 
345 

Thr Trp Asn Glu 

Thr Leu Pro Cys 
380 

Gly Lys Ala Met 
395 

Ser Asn lie Thr 
410 

Ser Ser Ser Gly 
425 

Asp Asn Trp Arg 

Pro Leu Gly lie 
460 

Glu Lys Arg Ala 
475 

Ala Ala Gly Ser 
490 

Ala Arg Gin Leu 
505 

Arg Ala lie Glu 

lie Lys Gin Leu 
540 

Asp Gin Arg Phe 
555 
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95 

Asn Ser Thr Asp 
110 

Asn Cys Ser Phe 
125 

Glu Tyr Ala Leu 

Asn Thr Ser Tyr 
160 

Ala Cys Pro Lys 
175 

Pro Ala Gly Phe 
190 

Thr Gly Pro Cys 
205 

Arg Pro Val Val 

Glu Glu Val Val 
240 

lie lie Val Gin 
255 

Asn Asn Asn Thr 
270 

Tyr Thr Thr Gly 
285 

lie Ser Arg Thr 

Leu Arg Glu Lys 
320 

Gly Gly Asp Pro 
335 

Phe Phe Tyr Cys 
350 

Thr Asn Ser Glu 
365 

Arg lie Lys Gin 

Tyr Ala Pro Pro 
400 

Gly Leu Leu Leu 
415 

Lys Glu lie Phe 
430 

Ser Glu Leu Tyr 
445 

Ala Pro Thr Lys 

Val Gly He Gly 
480 

Thr Met Gly Ala 
495 

Leu Ser Gly lie 
510 

Ala Gin Gin His 
525 

Gin Ala Arg Val 

Leu Gly Met Trp 
560 
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Gly 


Cys 


Ser 


Gly 


Lys 


Leu 


He 


Cys 


Thr 


Thr 


Ala 


Val 


Pro 


Trp 


Asn 


Ala 










565 










570 










575 




Ser 


Trp 


Ser 


Asn 


Lys 


Asn 


Leu 


Ser 


Gin 


He 


Trp 


Asp 


Asn 


Met 


Thr 


Trp 








580 










585 










590 






Met 


Glu 


Trp 


Glu 


Arg 


Glu 


He 


Ser 


Asn 


Tyr 


Thr 


Glu 


He 


He 


Tyr 


Ser 






595 










600 










605 








Leu 


He 


Glu 


Glu 


Ser 


Gin 


Asn 


Gin 


Gin 


Glu 


Lys 


Asn 


Glu 


Leu 


Asp 


Leu 




610 










615 










620 










Leu 


Gin 


Leu 


Asp 


Lys 


Trp 


Ala 


Ser 


Leu 


Cys 


Asp 


Leu 


Arg 


He 







625 630 635 

<210> 69 

<211> 2071 

<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (1) . . . (2071) 
<400> 69 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 4 8 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
1 5 10 15 

gtg tgg aag gac gec acc acc acc ctg ttc tgc gee age gac gec aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gee tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

•ate ate age ctg tgg gac cag age ctg aa'g ccc tgc gtg aag ctg acc 288 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 336 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 110 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 384 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gec ctg 432 
Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 



ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 
Phe Tyr Ser Leu Asp lie Val Pro lie Asp Asn Asp Asn Thr Ser Tyr 
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145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate ace cag gec tgc ccc aag 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gec ccc gec ggc ttc 
Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gec ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 
Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gec gag gag gag gtg gtg 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gee aag acc ate ate gtg cag 
He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 
Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gec ttc tac acc acc ggc 
Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate ggc gac ate cgc cag gee cac tgc aac ate tct aga acc 
Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 

aac tgg acc aac acc ctg aag cgc gtg gec gag aag ctg cgc gag aag 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 
Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 
Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 
Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 
Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg lie Lys Gin 
370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gee atg tac gec ccc ccc 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 
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lie lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 

385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate ace ggc ctg ctg ctg 

He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 

405 410 415 

ace cgc gac ggc ggc age gac aac teg age age ggc aag gag att ttc 

Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu He Phe 

420 425 430 



gec aag cgc cgc gtg gtg cag cgc gag aag cgc gec gtg ggc ate ggc 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly 

465 470 475 480 

get atg ttc etc ggc ttc ctg ggc get gca ggc age acc atg ggc gee 

Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 

485 490 495 



gtg cag cag cag aac aac ctg ctg cgc gee ate gag gec cag cag cac 
Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His 
515 520 525 



ctg get eta gag cgc tac etc cag gac cag cgc ttc ctg ggc atg tgg 
Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 555 560 



atg gag tgg gag cgc gag ate age aac tac acc gag ate ate tac age 
Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu lie lie Tyr Ser 
595 600 605 



1248 



1296 



cgc ccc ggc ggc ggc gac atg cgc gac aac tgg cgc age gag ctg tac 1344 
KTg Pro Gly Gly Gly Asp Met Arg Asp"7Vsh Trp Arg Ser Glu Leu Tyr 
435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gec ccc acc aag 1392 
Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 
450 455 460 



1440 



1488 



gec age ctg acc ctg acc gtg cag gee cgc cag ctg ctg age ggc ate 1536 
Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He 
500 505 510 



1584 



ctg etc cag ctg acc gtg tgg ggc ate aag cag etc cag gee cgc gtg 1632 
Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val 
530 535 540 



1680 



ggc tgc tec ggc aag ctg ate tgc acc acg gee gtg ccc tgg aac gee 1728 

Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 

565 570 575 

age tgg age aac aag aac ctg age cag att tgg gac aac atg acc tgg 1776 

Ser Trp Ser Asn Lys Asn Leu Ser Gin He Trp Asp Asn Met Thr Trp 

580 585 590 



1824 



ctg ate gag gag age cag aac cag cag gag aag aac gag ctg gac ctg 1872 
Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 
610 615 620 
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etc cag ctg gac aag tgg gca age ttg tgg aac tgg ttc aac ate acc 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn lie Thr 

625 630 635 640 

aac tgg ctg tgg tac ate aag att ttc ate atg ate gtg ggc ggc ctg 
Asn Trp Leu Trp Tyr He Lys He Phe He Met lie Val Gly Gly Leu 
645 650 655 

ate ggc ctg cgc ate gtg ttc acc gtg ctg age ate gtg aac cgc gtg 
He Gly Leu Arg He Val Phe Thr Val Leu Ser He Val Asn Arg Val 
660 665 670 

cgc cag gga tgc age ccc ctg age ttc cag acc cgc ctg ccc gtg tga 
Arg Gin Gly Cys Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Val * 
675 680 685 

egg ate c 
Arg He 



1920 



1968 



2016 



2064 



2071 



<210> 70 
<211> 689 
<212> PRT 

<213> Artificial Sequence 
<400> 70 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 

Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 

20 25 30 

Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 

35 40 45 

Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 

50 55 60 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 HO 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 

165 170 175 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 

210 215 220 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 
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Leu Asn Glu Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr 

260 265 270 

Arg Lys Ser lie His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 

275 280 285 

Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 

290 295 300 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 330 335 

Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

340 345 350 

TTTn Thr Thr Gin Leu Phe Asn Ser TffFTrp Asn Glu Thr Asn Ser Glu 

355 360 365 

Gly Asn lie Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 

370 375 380 

He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

lie Gly Gly Gin lie Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 

405 410 415 

Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu lie Phe 

420 425 430 

Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 

435 440 445 

Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 

450 455 460 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly lie Gly 
465 470 475 480 

Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 

485 490 495 

Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly lie 

500 505 510 

Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His 

515 520 525 

Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val 

530 535 540 

Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 
545 550 555 560 

Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ala 

565 570 ' 575 

Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp Asp Asn Met Thr Trp 

580 585 590 

Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu lie lie Tyr Ser 

595 600 605 

Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 

610 615 620 

Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn lie Thr 
625 630 635 640 

Asn Trp Leu Trp Tyr lie Lys lie Phe lie Met lie Val Gly Gly Leu 

645 650 655 

lie Gly Leu Arg lie Val Phe Thr Val Leu Ser lie Val Asn Arg Val 

660 665 670 

Arg Gin Gly Cys Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Val Arg 
675 680 685 

lie 



<210> 71 
<211> 2469 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . . (2469) 

<400> 71 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 

Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 

Tjrg tgg aag gac gec acc acc acc cttT'ttc tgc gec age gac gec aag 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gec tac gac acc gag gtg cac aac gtg tgg gec acc cac gcg tgc gtg 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 

ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 

aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 
lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 HO 

acc aac aac acc cgc tgg ggc acc cag gag atg aag aac tgc age ttc 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 
Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 
Phe Tyr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



cgc ctg cgc age tgc aac aca teg ate ate acc cag gee tgc ccc aag 528 

Arg Leu Arg Ser Cys Asn Thr Ser He He Thr Gin Ala Cys Pro Lys 

165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gec ggc ttc 576 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 



gee ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 
Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 



624 
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acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gee gag gag gag gtg gtg 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

ate aga tct gag aac ttc acc aac aac gec aag acc ate ate gtg cag 
He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 
Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 
260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gee ttc tac acc acc ggc 
Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 
275 280 285 

gac ate ate ggc gac ate cgc cag gee cac tgc aac ate tct aga acc 
Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 
290 295 300 

■ aac tgg acc aac acc ctg aag cgc gtg gee gag aag ctg cgc gag aag 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 
Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 
Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 
Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 
Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 
370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gee atg tac gee ccc ccc 
He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 
He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
405 410 415 

acc cgc gac ggc ggc age gac aac teg age age ggc aag gag att ttc 
Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu He Phe 
420 425 430 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



cgc ccc ggc ggc ggc gac atg cgc gac aac tgg cgc age gag ctg tac 1344 
Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
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435 440 445 

aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gec ccc acc aag 
Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 
450 455 460 

gec aag cgc cgc gtg gtg cag cgc gag aag cgc gec gtg ggc ate ggc 
Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly He Gly 
465 470 475 480 

get atg ttc etc ggc ttc ctg ggc get gca ggc age acc atg ggc gec 
Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
485 490 495 



gtg cag cag cag aac aac ctg ctg cgc gec ate gag gec cag cag cac 
Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His 
515 520 525 



580 585 590 

atg gag tgg gag cgc gag ate age aac tac acc gag ate ate tac age 

Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Glu He He Tyr Ser 

595 600 605 



etc cag ctg gac aag tgg gca age ttg tgg aac tgg ttc aac ate acc 
Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr 
625 630 635 640 

aac tgg ctg tgg tac ate aag att ttc ate atg ate gtg ggc ggc ctg 
Asn Trp Leu Trp Tyr He Lys He Phe He Met lie Val Gly Gly Leu 
645 650 655 

ate ggc ctg cgc ate gtg ttc acc gtg ctg age ate gtg aac cgc gtg 
He Gly Leu Arg He Val Phe Thr Val Leu Ser He Val Asn Arg Val 
660 665 670 

cgc cag ggc tac age ccc ctg age ttc cag acc cgc ctg ccc gtg ccc 



1392 



1440 



1488 



gee age ctg acc ctg acc gtg cag gee cgc cag ctg ctg age ggc ate 1536 
Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He 
500 505 510 



1584 



ctg etc cag ctg acc gtg tgg ggc ate aag cag etc cag gec cgc gtg 1632 
Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val 
530 535 540 



1680 



1728 



ctg get eta gag cgc tac etc cag gac cag cgc ttc ctg ggc atg tgg 

Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg Phe Leu Gly Met Trp 

545 550 555 560 

ggc tgc tec ggc aag ctg ate tgc acc acg gee gtg ccc tgg aac gec 

Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 

565 570 575 

age tgg age aac aag aac ctg age cag att tgg gac aac atg acc tgg 1776 

Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp Asp Asn Met Thr Trp 

cn^ QQc c,Qf) 



1824 



ctg ate gag gag age cag aac cag cag gag aag aac gag ctg gac ctg 1872 
Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Leu Asp Leu 
610 615 620 



1920 



1968 



2016 



2064 
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Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu Pro Val Pro 

675 680 685 

cgc ggc ccc gac cgc ccc gag ggc ate gag gag gag ggc ggc gag cgc 2112 

Arg Gly Pro Asp Arg Pro Glu Gly lie Glu Glu Glu Gly Gly Glu Arg 

690 695 *?00 



gac cgc gac cgc age acc cgc ctg gtg acc ggc ttc ctg ccc ctg ate 
Asp Arg Asp Arg Ser Thr Arg Leu Val Thr Gly Phe Leu Pro Leu lie 
705 710 715 720 

tgg gac gac ctg cgc age ctg ttc ctg ttc age tac cat cga ttg cgc 
¥*p Asp Asp Leu Arg Ser Leu Phe Leu-Phe Ser Tyr His Arg Leu Arg 
725 730 735 



ggc tgg gag ate ctg aag tac tgg tgg aac ctg etc cag tac tgg age 
Gly Trp Glu He Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr Trp Ser 
755 760 765 



gec gtg gee gag ggc acc gac cgc gtg ate gag gtg gtg cag cgc ate 
Ala Val Ala Glu Gly Thr Asp Arg Val He Glu Val Val Gin Arg He 
785 790 795 800 

tgg cgc ggc ate ctg cac ate ccc acc cga att cgc cag ggc ttc gag 
Trp Arg Gly He Leu His He Pro Thr Arg He Arg Gin Gly Phe Glu 
805 810 815 

cgc gee ctg ctg taa gga tec 
Arg Ala Leu Leu * Gly Ser 
820 



<210> 72 
<211> 822 
<212> PRT 

<213> Artificial Sequence 



2160 



2208 



gac ctg ctg ctg ate gtg gee cgc ate gtg gag ctg ctg ggc egg cgc 2256 
Asp Leu Leu Leu He Val Ala Arg He Val Glu Leu Leu Gly Arg Arg 
740 745 750 



2304 



cag gag ctg aag aac tct gca gtg age ctg ctg aac gee acc gec ate 2352 
Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr Ala He 
770 775 780 



2400 



2448 



2469 



<400> 72 



Ala 


Ser 


Ala 


Ala 


Asp 


Arg 


Leu 


Trp 


Val 


Thr 


Val 


Tyr 


Tyr 


Gly 


Val 


Pro 


1 








5 










10 










15 




Val 


Trp 


Lys 


Asp 


Ala 


Thr 


Thr 


Thr 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Lys 




20 










25 










30 






Ala 


Tyr 


Asp 


Thr 


Glu 


Val 


His 


Asn 


Val 


Trp 


Ala 


Thr 


His 


Ala 


Cys 


Val 






35 










40 










45 








Pro Thr Asp 


Pro 


Asn 


Pro 


Gin 


Glu 


Val 


Val 


Leu 


Gly 


Asn 


Val 


Thr 


Glu 




50 










55 










60 










Asn 


Phe 


Asn 


Met 


Gly 


Lys 


Asn 


Asn 


Met 


Val 


Glu 


Gin 


Met 


His 


Glu 


Asp 


65 










70 










75 










80 


He 


He 


Ser 


Leu 


Trp 


Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


Val 


Lys 


Leu 


Thr 










85 










90 










95 




Pro 


Leu 


Cys 


Val 


Thr 


Leu 


Asn 


Cys 


Thr 


Lys 


Leu 


Lys 


Asn 


Ser 


Thr 


Asp 
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100 

Thr Asn Asn Thr 
115 

Asn lie Ser Thr 
130 

Phe Tyr Ser Leu 
145 

Arg Leu Arg Ser 

Val Ser Phe Glu 
180 

Ala lie Leu Lys 
195 

Thr Asn Val Ser 
210 

Ser Thr Gin Leu 
225 

lie Arg Ser Glu 

Leu Asn Glu Ser 
260 

Arg Lys Ser He 
275 

Asp He He Gly 
290 

Asn Trp Thr Asn 
305 

Phe Asn Asn Thr 

Glu He Val Met 
340 

Asn Thr Thr Gin 
355 

Gly Asn He Thr 
370 

lie He Asn Met 
385 

He Gly Gly Gin 

Thr Arg Asp Gly 
420 

Arg Pro Gly Gly 
435 

Lys Tyr Lys Val 
450 

Ala Lys Arg Arg 
465 

Ala Met Phe Leu 

Ala Ser Leu Thr 
500 

Val Gin Gin Gin 
515 

Leu Leu Gin Leu 
530 

Leu Ala Leu Glu 
545 

Gly Cys Ser Gly 



Arg Trp Gly Thr 
120 

Ser Val Arg Asn 
135 

Asp lie Val Pro 
150 

Cys Asn Thr Ser 
165 

Pro lie Pro He 

Cys Asn Asn Lys 
200 

Thr Val Gin Cys 
215 

Leu Leu Asn Gly 
230 

Asn Phe Thr Asn 
245 

Val Glu He Asn 

His lie Gly Pro 
280 

Asp lie Arg Gin 
295 

Thr Leu Lys Arg 
310 

Thr He Val Phe 
325 

His Ser Phe Asn 

Leu Phe Asn Ser 
360 

Ser Gly Thr lie 
375 

Trp Gin Glu Val 
390 

lie Lys Cys Leu 
405 

Gly Ser Asp Asn 

Gly Asp Met Arg 
440 

Val Lys He Glu 
455 

Val Val Gin Arg 
470 

Gly Phe Leu Gly 
485 

Leu Thr Val Gin 

Asn Asn Leu Leu 
520 

Thr Val Trp Gly 
535 

Arg Tyr Leu Gin 
550 

Lys Leu lie Cys 
565 



105 

Gin Glu Met Lys 

Lys Met Lys Arg 
140 

lie Asp Asn Asp 
155 

He lie Thr Gin 
170 

His Phe Cys Ala 
185 

Thr Phe Asn Gly 

Thr His Gly He 
220 

Ser Leu Ala Glu 
235 

Asn Ala Lys Thr 
250 

Cys Thr Arg Pro 
265 

Gly Arg Ala Phe 

Ala His Cys Asn 
300 

Val Ala Glu Lys 
315 

Asn Gin Ser Ser 
•330 

Cys Gly Gly Glu 
345 

Thr Trp Asn Glu 

Thr Leu Pro Cys 
380 

Gly Lys Ala Met 
395 

Ser Asn lie Thr 
410 

Ser Ser Ser Gly 
425 

Asp Asn Trp Arg 

Pro Leu Gly He 
460 

Glu Lys Arg Ala 
475 

Ala Ala Gly Ser 
4 90 

Ala Arg Gin Leu 
505 

Arg Ala lie Glu 

He Lys Gin Leu 
540 

Asp Gin Arg Phe 
555 

Thr Thr Ala Val 
570 





110 






Asn Cys 


Ser 


Phe 


125 








Glu 


Tyr 


Ala 
Hid 


T .oil 


Asn 


Thr 


Ser 


Tyr 








160 


Ala 


Cys 


Pro 


Lys 






175 




Pro 


Ala 


Gly 


Phe 




190 






Thr Gly 


Pro 


Cys 


205 








Arg 


Pro 


Vd A 


vox 


Glu 


Glu 


Val 


Val 








240 


He 


He 


Val 


Gin 






255 




Asn 


Asn 


Asn 


Thr 




270 






Tyr 


Thr 


Thr 


Gly 


285 








lie 


Ser 


7\ V /T 


Thr 
x *ix 


Leu 


Arg 


Glu 


Lys 








320 


Gly Gly 


Asp 


Pro 






335 




Phe 


Phe 


Tyr 


Cys 




350 






Thr 


Asn 


Ser 


Glu 


365 








Arg 


lie 


Lys 


bin 


Tyr 


Ala 


Pro 


Pro 








400 


Gly 


Leu 


Leu 


Leu 






415 




Lys 


Glu 


He 


Phe 




430 






Ser 


Glu 


Leu 


Tyr 


445 








Ala 


Pro 


Thr 




Val 


Gly 


lie 


Gly 








480 


Thr 


Met 


Gly 


Ala 






495 




Leu 


Ser 


Gly 


lie 




510 






Ala 


Gin 


Gin 


His 


525 








Gin 


Ala 


Arg 


Val 


Leu 


Gly 


Met 


Trp 








560 


Pro 


Trp 


Asn 


Ala 






575 
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Trp 


OCX 


fien 
noli 

580 

■J O V 


T .v^ 
xj y o 






Ser 


Gin 
585 


He 


Trn 


Asp 


Asn 


Met 
590 


Thr 


TrD 


Met 

1 ITS L. 


U JL U 


i rp 


Glu 

u x u 


Arn 


Glu 

w J» w 


He 

JL JV w 


Ser 
600 


Asn 


Tyr 


Thr 


Glu 


He 
605 


He 


Tvr 


Ser 


Leu 


He 


Glu 


Glu 


Ser 


Gin 


Asn 
615 


Gin 


Gin 


Glu 


Lys 


Asn 
620 


Glu 


Leu 


Asp 


Leu 


T .on 


Gin 

will 


Xf c u 




1 jV^ 
jlj y o 


Trn 
i xp 


Ala 


Ser 


Leu 


Trp 


Asn 


TrD 


Phe 


Asn 


He 


Thr 












6*30 










635 










640 


Asn 


Trp 


Leu 


Trp 


Tyr 
fid s 


He 


Lys 


He 


Phe 


He 
650 


Met 


He 


Val 


Gly 


Gly 
655 


Leu 


He 


Gly 


Leu 


Arg 


He 


Val 


Phe 


Thr 


Val 


Leu 


Ser 


He 


Val 


Asn 


Arg 


Val 
















665 










670 






A&q 


Gin 


Gly 


Tyr 


Ser 


Pro 


Leu 


Ser 


Phe-Glfi 


Thr 


Arg 


Leu 


Pro 


Val 


Pro 
















6R0 










685 

V V 








Arg 


Gly 
can 


Pro 


Asp 


Arg 


Pro 


Glu 


Gly 


He 


Glu 


Glu 


Glu 
700 


Gly 


Gly 


Glu 


Arg 


A or** 
Mo 


/ii. y 


A An 


r\L y 




Thr 

1 1 1 X 


xAi y 


T .on 

JjC Li 


Val 


Thr 


Gl v 


Phe 




Pro 


Tipn 


He 






















715 










720 


Trp 


Asp 


Asp 


Leu 


Arg 
7?S 


Ser 


Leu 


Phe 


Leu 


Phe 
730 


Ser 


Tyr 


His 


Arg 


Leu 
735 


Arg 


Asp 


Leu 


Leu 


Leu 
740 


He 


Val 


Ala 


Ara 


He 
745 


Val 


Glu 


Leu 


Leu 


Gly 
ISO 


Arq 


Arq 


Gly 


Trp 


Glu 
755 


He 


Leu 


Lys 


Tyr 


Trp 
760 


Trp 


Asn 


Leu 


Leu 


Gin 
765 


Tyr 


Trp 


Ser 


Gin 


Glu 
770 


Leu 


Lys 


Asn 


Ser 


Ala 
775 


Val 


Ser 


Leu 


Leu 


Asn 
780 


Ala 


Thr 


Ala 


He 


Ala 


Val 


Ala 


Glu 


Gly 


Thr 


Asp 


Arg 


Val 


He 


Glu 


Val 


Val 


Gin 


Arg 


He 


785 










790 










795 










800 


Trp 


Arg 


Gly 


He 


Leu 
805 


His 


He 


Pro 


Thr 


Arg 
810 


He 


Arg 


Gin 


Gly 


Phe 
815 


Glu 


Arg 


Ala 


Leu 


Leu 


Gly 


Ser 























820 

<210> 73 
<211> 1431 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)...<1431) 

<400> 73 

get age gcg gec gac cgc ctg tgg gtg acc gtg tac tac ggc gtg ccc 48 
Ala Ser Ala Ala Asp Arg Leu Trp Val Thr Val Tyr Tyr Gly Val Pro 
15 10 15 

gtg tgg aag gac gec acc acc acc ctg ttc tgc gec age gac gec aag 96 
Val Trp Lys Asp Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
20 25 30 

gec tac gac acc gag gtg cac aac gtg tgg gee acc cac gcg tgc gtg 144 
Ala Tyr Asp Thr Glu Val His Asn Val Trp Ala Thr His Ala Cys Val 
35 40 45 



ccc acc gac ccc aac ccc cag gag gtg gtg ctg ggc aac gtg acc gag 192 
Pro Thr Asp Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu 
50 55 60 
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288 



336 



384 



432 



aac ttc aac atg ggc aag aac aac atg gtg gag cag atg cac gag gat 240 
Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

ate ate age ctg tgg gac cag age ctg aag ccc tgc gtg aag ctg acc 
He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

ccc ctg tgc gtg acc ctg aac tgc acc aag ctg aag aac age acc gac 
Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 
100 105 HO 

*cc aac aac acc cgc tgg ggc acc cag-gag atg aag aac tgc age ttc 
Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 
115 120 125 

aac ate age acc age gtg cgc aac aag atg aag cgc gag tac gee ctg 
Asn He Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 
130 135 140 

ttc tac age ctg gac ate gtg ccc ate gac aac gac aac acc age tac 480 
Phe Tyr Ser Leu Asp lie Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
145 150 155 160 

cgc ctg cgc age tgc aac aca teg ate ate acc cag gec tgc ccc aag 528 
Arg Leu Arg Ser Cys Asn Thr Ser lie lie Thr Gin Ala Cys Pro Lys 
165 170 175 

gtg age ttc gag ccc ate ccc ate cac ttc tgc gee ccc gee ggc ttc 576 
Val Ser Phe Glu Pro He Pro lie His Phe Cys Ala Pro Ala Gly Phe 
180 185 190 

gec ate ctg aag tgc aac aac aag acc ttc aac ggc acc ggc ccc tgc 
Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 
195 200 205 

acc aac gtg age acc gtg cag tgc acc cac gga att cgc ccc gtg gtg 
Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val 
210 215 220 

age acc cag ctg ctg ctg aac ggc age ctg gee gag gag gag gtg gtg 
Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 



624 



672 



720 



ate aga tct gag aac ttc acc aac aac gee aag acc ate ate gtg cag 768 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr lie lie Val Gin 
245 250 255 

ctg aac gag age gtg gag ate aac tgc acc cgc ccc aac aac aac acc 816 

Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 

260 265 270 

cgc aag age ate cac ate ggc cct ggc cgc gec ttc tac acc acc ggc 864 

Arg Lys Ser lie His lie Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 

275 280 285 

gac ate ate ggc gac ate cgc cag gec cac tgc aac ate tct aga acc 912 

Asp lie lie Gly Asp lie Arg Gin Ala His Cys Asn lie Ser Arg Thr 

290 295 300 
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aac tgg acc aac acc ctg aag cgc gtg gcc gag aag ctg cgc gag aag 
Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

ttc aac aac acc acc ate gtg ttc aac cag age tec ggc ggc gac ccc 
Phe Asn Asn Thr Thr lie Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 
325 330 335 

gag ate gtg atg cac age ttc aac tgc ggc ggc gag ttc ttc tac tgc 
Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 
340 345 350 

aac acc acc cag ctg ttc aac age acc tgg aac gag acc aac age gag 
Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 
355 360 365 

ggc aac ate act agt ggc acc ate acc ctg ccc tgc cgc ate aag cag 
Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 
370 375 380 

ate ate aac atg tgg cag gag gtg ggc aag gcc atg tac gcc ccc ccc 
He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 



acc cgc gac ggc ggc age gac aac teg age age ggc aag gag att ttc 
Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu He Phe 
420 425 430 



aag tac aag gtg gtg aag ate gag ccc ctg ggc ate gcc ccc acc aag 
Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 
450 455 460 



960 



1008 



1056 



1104 



1152 



1200 



ate ggc ggc cag ate aag tgc ctg age aac ate acc ggc ctg ctg ctg 1248 
He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 
405 410 415 



1296 



cgc ccc ggc ggc ggc gac atg cgc gac aac tgg cgc age gag ctg tac 134 4 
Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
435 440 445 



1392 



gcc aag cgc cgc gtg gtg cag cgc gag aag cgc gcc tag 14 31 

Ala Lys' Arg Arg Val Val Gin Arg Glu Lys Arg Ala * 
465 470 475 



<210> 74 
<211> 476 
<212> PRT 

<213> Artificial Sequence 
<400> 74 

Ala Ser Ala Ala Asp Arg Leu Trp 

1 5 
Val Trp Lys Asp Ala Thr Thr Thr 
20 

Ala Tyr Asp Thr Glu Val His Asn 

35 40 
Pro Thr Asp Pro Asn Pro Gin Glu 



Val Thr Val Tyr Tyr Gly Val Pro 

10 15 
Leu Phe Cys Ala Ser Asp Ala Lys 
25 30 
Val Trp Ala Thr His Ala Cys Val 
45 

Val Val Leu Gly Asn Val Thr Glu 
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50 55 60 

Asn Phe Asn Met Gly Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
65 70 75 80 

He He Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 

85 90 95 

Pro Leu Cys Val Thr Leu Asn Cys Thr Lys Leu Lys Asn Ser Thr Asp 

100 105 HO 

Thr Asn Asn Thr Arg Trp Gly Thr Gin Glu Met Lys Asn Cys Ser Phe 

115 120 125 

Asn lie Ser Thr Ser Val Arg Asn Lys Met Lys Arg Glu Tyr Ala Leu 

130 135 140 

Phe Tvr Ser Leu Asp He Val Pro He Asp Asn Asp Asn Thr Ser Tyr 
*45 150 155 160 

Arg Leu Arg Ser Cys Asn Thr Ser He lie Thr Gin Ala Cys Pro Lys 

165 170 175 

Val Ser Phe Glu Pro He Pro He His Phe Cys Ala Pro Ala Gly Phe 

180 185 190 

Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys 

195 200 205 

Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val 

210 215 220 

Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val 
225 230 235 240 

He Arg Ser Glu Asn Phe Thr Asn Asn Ala Lys Thr He lie Val Gin 

245 250 255 

Leu Asn Glu Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr 

260 265 270 

Arg Lys Ser He His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly 

275 280 285 

Asp He He Gly Asp He Arg Gin Ala His Cys Asn He Ser Arg Thr 

290 295 300 

Asn Trp Thr Asn Thr Leu Lys Arg Val Ala Glu Lys Leu Arg Glu Lys 
305 310 315 320 

Phe Asn Asn Thr Thr He Val Phe Asn Gin Ser Ser Gly Gly Asp Pro 

325 330 335 

Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys 

340 345 350 

Asn Thr Thr Gin Leu Phe Asn Ser Thr Trp Asn Glu Thr Asn Ser Glu 

355 360 365 

Gly Asn He Thr Ser Gly Thr He Thr Leu Pro Cys Arg He Lys Gin 

370 375 380 

He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
385 390 395 400 

He Gly Gly Gin He Lys Cys Leu Ser Asn He Thr Gly Leu Leu Leu 

405 410 415 

Thr Arg Asp Gly Gly Ser Asp Asn Ser Ser Ser Gly Lys Glu He Phe 

420 425 430 

Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 

435 440 445 

Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly He Ala Pro Thr Lys 

450 455 460 

Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala 
465 470 475 

<210> 75 

<211> 1038 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<221> CDS 

<222> (I),.. (1038) 

<400> 75 

gcc gtg ggc ate ggc get atg ttc etc ggc ttc ctg ggc get gca ggc 

Ala Val Gly lie Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly 

15 10 15 

age ace atg ggc gcc gcc age ctg ace ctg acc gtg cag gcc cgc cag 
Ser Thr Met Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg Gin 
20 25 30 



48 



96 



ctg ctg age ggc ate gtg cag cag cag aac aac ctg ctg cgc gcc ate 14 4 

Leu Leu Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie 
35 40 45 

gag gcc cag cag cac ctg etc cag ctg acc gtg tgg ggc ate aag cag 192 
Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin 
50 55 60 

etc cag gcc cgc gtg ctg get eta gag cgc tac etc cag gac cag cgc 
Leu Gin Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Gin Asp Gin Arg 
65 70 75 80 

ttc ctg ggc atg tgg ggc tgc tec ggc aag ctg ate tgc acc acg gcc 
Phe Leu Gly Met Trp Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala 

85 90 95 

gtg ccc tgg aac gcc age tgg age aac aag aac ctg age cag att tgg 
Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Asn Leu Ser Gin lie Trp 
100 105 HO 

gac aac atg acc tgg atg gag tgg gag cgc gag ate age aac tac acc 
Asp Asn Met Thr Trp Met Glu Trp Glu Arg Glu lie Ser Asn Tyr Thr 
115 120 125 

gag ate ate tac age ctg ate gag gag age cag aac cag cag gag aag 
Glu lie lie Tyr Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys 
130 135 140 

aac gag ctg gac ctg etc cag ctg gac aag tgg gca age ttg tgg aac 
Asn Glu Leu Asp Leu Leu Gin Leu Asp Lys Trp Ala Ser Leu Trp Asn 
145 150 155 160 

tgg ttc aac ate acc aac tgg ctg tgg tac ate aag att ttc ate atg 
Trp Phe Asn He Thr Asn Trp Leu Trp Tyr He Lys He Phe He Met 
165 170 175 

ate gtg ggc ggc ctg ate ggc ctg cgc ate gtg ttc acc gtg ctg age 
He Val Gly Gly Leu He Gly Leu Arg He Val Phe Thr Val Leu Ser 
180 185 190 

ate gtg aac cgc gtg cgc cag ggc tac age ccc ctg age ttc cag acc 
He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr 
195 200 205 

cgc ctg ccc gtg ccc cgc ggc ccc gac cgc ccc gag ggc ate gag gag 
Arg Leu Pro Val Pro Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 
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210 215 220 

gag ggc ggc gag cgc gac cgc gac cgc age acc cgc ctg gtg acc ggc 

Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Thr Arg Leu Val Thr Gly 

225 230 235 240 

ttc ctg ccc ctg ate tgg gac gac ctg cgc age ctg ttc ctg ttc age 

Phe Leu Pro Leu lie Trp Asp Asp Leu Arg Ser Leu Phe Leu Phe Ser 

245 250 255 

tac cat cga ttg cgc gac ctg ctg ctg ate gtg gec cgc ate gtg gag 

Tyr His Arg Leu Arg Asp Leu Leu Leu lie Val Ala Arg lie Val Glu 

260 265 270 



720 



768 



816 



ctg ctg ggc egg cgc ggc tgg gag ate ctg aag tac tgg tgg aac ctg 864 
Leu Leu Gly Arg Arg Gly Trp Glu lie Leu Lys Tyr Trp Trp Asn Leu 
275 280 285 

etc cag tac tgg age cag gag ctg aag aac tct gca gtg age ctg ctg 912 
Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu 
290 295 300 

aac gec acc gec ate gec gtg gee gag ggc acc gac cgc gtg ate gag 960 
Asn Ala Thr Ala He Ala Val Ala Glu Gly Thr Asp Arg Val He Glu 
305 310 315 320 

gtg gtg cag cgc ate tgg cgc ggc ate ctg cac ate ccc acc cga att 1008 
Val Val Gin Arg He Trp Arg Gly He Leu His He Pro Thr Arg He 
325 330 335 

cgc cag ggc ttc gag cgc gee ctg ctg taa 1038 
Arg Gin Gly Phe Glu Arg Ala Leu Leu * 
340 345 



<210> 76 

<211> 345 

<212> PRT 

<213> Artificial Sequence 



<400> 76 



Ala 


Val 


Gly 


He 


Gly. 


Ala 


Met 


Phe 


Leu Gly 


Phe 


Leu 


Gly 


Ala 


Ala 


Gly 


1 






5 








10 










15 




Ser 


Thr 


Met 


Gly 


Ala 


Ala 


Ser 


Leu 


Thr Leu 


Thr 


Val 


Gin 


Ala 


Arg 


Gin 








20 










25 








30 






Leu 


Leu 


Ser 


Gly 


He 


Val 


Gin 


Gin 


Gin Asn 


Asn 


Leu 


Leu 


Arg 


Ala 


He 






35 










40 








45 








Glu 


Ala 


Gin 


Gin 


His 


Leu 


Leu 


Gin 


Leu Thr 


Val 


Trp 


Gly 


He 


Lys 


Gin 




50 










55 








60 










Leu 


Gin 


Ala 


Arg 


Val 


Leu 


Ala 


Leu 


Glu Arg 


Tyr 


Leu 


Gin 


Asp 


Gin 


Arg 


65 










70 








75 










80 


Phe 


Leu 


Gly 


Met 


Trp 


Gly 


Cys 


Ser 


Gly Lys 


Leu 


He 


Cys 


Thr 


Thr 


Ala 










85 








90 










95 




Val 


Pro 


Trp Asn 


Ala 


Ser 


Trp 


Ser 


Asn Lys 


Asn 


Leu 


Ser 


Gin 


He 


Trp 








100 










105 








110 






Asp Asn 


Met 


Thr 


Trp 


Met 


Glu 


Trp Glu Arg 


Glu 


He 


Ser 


Asn 


Tyr 


Thr 






115 










120 








125 








Glu 


He 


He 


Tyr 


Ser 


Leu 


He 


Glu 


Glu Ser 


Gin 


Asn 


Gin 


Gin 


Glu 


Lys 



130 135 140 
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Asn 


Glu 


Leu 


Asp 


Leu 


Leu 


Gin 


Leu 


145 










150 






Trp 


Phe 


Asn 


He 


Thr 


Asn 


Trp 


Leu 










165 








lie 


Val 


Gly 


Gly 


Leu 


He 


Gly 


Leu 








180 










He 


Val 


Asn 


Arg 


Val 


Arg 


Gin 


Gly 






195 










200 


Arg 


Leu 


Pro 


Val 


Pro 


Arg 


Gly 


Pro 




210 










215 




Glu Gly 


Gly 


Glu 


Arg 


Asp 


Arg 


Asp 


225 










230 








Leu 


Pro 


Leu 


He 


Trp 


Asp 


Asp 










245 








Tyr 


His 


Arg 


Leu 


Arg 


Asp 


Leu 


Leu 








260 










Leu 


Leu 


Gly 


Arg 


Arg 


Gly 


Trp 


Glu 






275 










280 


Leu 


Gin 


Tyr 


Trp 


Ser 


Gin 


Glu 


Leu 




290 










295 




Asn 


Ala 


Thr 


Ala 


lie 


Ala 


Val 


Ala 


305 










310 






Val 


Val 


Gin 


Axg 


He 


Trp 


Arg 


Gly 










325 








Arg 


Gin 


Gly 


Phe 


Glu 


Arg 


Ala 


Leu 



340 



57 



Asp 


Lys 


Trp 
155 


Ala 


Ser 


Leu 


Trp 


Asn 
160 


Trp 


Tyr 
170 


He 


Lys 


He 


Phe 


He 
175 


Met 


Arg 


He 


Val 


Phe 


Thr 


Val 


Leu 


Ser 


185 










190 






Tyr 


Ser 


Pro 


Leu 


Ser 
205 


Phe 


Gin 


Thr 


Asp 


Arg 


Pro 


Glu 
220 


Gly 


He 


Glu 


Glu 


Arg 


Ser 


Thr 
235 


Arg 


Leu 


Val 


Thr 


Gly 
240 


Leu— Arg 


Ser 


Leu 


Phe 


Leu 


Phe 


Ser 




250 










255 




Leu 


He 


Val 


Ala 


Arg 


He 


Val 


Glu 


265 










270 






He 


Leu 


Lys 


Tyr 


Trp 
285 


Trp 


Asn 


Leu 


Lys 


Asn 


Ser 


Ala 
300 


Val 


Ser 


Leu 


Leu 


Glu 


Gly 


Thr 
315 


Asp 


Arg 


Val 


He 


Glu 
320 


He 


Leu 
330 


His 


He 


Pro 


Thr 


Arg 
335 


He 


Leu 
















345 

















